Re: BSOD
Thanks!
We have three attached DMP files, all of the same bug check:
All of the attached DMP files are of the
WHEA_UNCORRECTABLE_ERROR (124) bugcheck.
A fatal hardware error has occurred. This fatal error displays data from the Windows Hardware Error Architecture (WHEA).
If we run an !errrec on the 2nd parameter of the bugcheck (address of the WER structure) we get the following:
Code:
===============================================================================
Section 0 : Processor Generic
-------------------------------------------------------------------------------
Descriptor @ fffffa80086380a8
Section @ fffffa8008638180
Offset : 344
Length : 192
Flags : 0x00000001 Primary
Severity : Fatal
Proc. Type : x86/x64
Instr. Set : x64
[COLOR=Red][I][B]Error Type : Cache error[/B][/I][/COLOR]
[COLOR=Red][I][B]Operation : Instruction Execute[/B][/I][/COLOR]
Flags : 0x00
Level : 3
CPU Version : 0x0000000000600f20
Processor ID : 0x0000000000000006
So we can see it's a Cache Error, specifically the operation was an Instruction Execute. As far as I know, a microproccesor typically breaks the execution of an instruction into smaller operations. Each of these operations completes a portion of the entire instruction's execution.
If we go further...
Code:
===============================================================================
Section 2 : x86/x64 MCA
-------------------------------------------------------------------------------
Descriptor @ fffffa8008638138
Section @ fffffa80086382c0
Offset : 664
Length : 264
Flags : 0x00000000
Severity : Fatal
[COLOR=Red][I][B]Error : ICACHELG_IRD_ERR (Proc 6 Bank 1)[/B][/I][/COLOR]
Status : 0xb600000000100153
Address : 0x0000000000000007
Misc. : 0x0000000000000000
I have never seen documented information on what 'ICACHELG_IRD_ERR' is, but it's a type of CPU cache error. In this specific dump, it's occurring on Processor # 6 and Cache Bank # 1. On the other two dumps, it occurs on Processor # 7 and Cache Bank #1 (the same Cache Bank as the first dump).
There is only so much you can do with a bugcheck like this until it comes down to a faulty processor that will need to be replaced. Start from 1 and work downward:
1. Ensure your temperatures are within standard and nothing's overheating. You can use a program such as Speccy if you'd like to monitor temps -
Speccy - System Information - Free Download
2. Clear your CMOS (or load optimized BIOS defaults) to ensure there's no improper BIOS setting -
How To Clear CMOS (Reset BIOS)
3. Ensure your BIOS is up to date.
4. The
only software conflict that can usually cause *124 bugchecks are OS to BIOS utilities from manufacturer's like Asus' AI Suite. If you have something like this software-wise, remove it ASAP.
5. If all of the above fail, the only left to do is replace your processor as it is faulty.
Regards,
Patrick