Operating System : Microsoft Windows 8 Pro 64 bit
Age of Hardware : 1 week
Age of Installation OS : 1 week
Custom Build PC
CPU : Intel i7 4770k
Video Card : Asus Geforce GTX780 Direct CU II x 2 running SLI
Ram : Corsair 32 GB Vengeance Pro 1866
Motherboard : Asus Maximus VI Extreme
Power Supply : Corsair AX 1200i
The DPC_WATCHDOG_VIOLATION bug check has a value of 0x00000133. This bug check indicates that the DPC watchdog executed, either because it detected a single long-running deferred procedure call (DPC), or because the system spent a prolonged time at an interrupt request level (IRQL) of DISPATCH_LEVEL or above. The value of Parameter 1 indicates whether a single DPC exceeded a timeout, or whether the system cumulatively spent an extended period of time at IRQL DISPATCH_LEVEL or above.
DRIVER_OVERRAN_STACK_BUFFER (f7)
The stack memory was corrupted by a device driver. I see this bugcheck when there are usually attempts by malicious software to gain control of the system. However, it can be just a regular device driver issue as well.
If we look at the call stack, we can see nvlddmkm being called right before the crash which is the nVidia video driver.
Ensure you have the latest video card drivers. If you are already on the latest video card drivers, uninstall and install a version or a few versions behind the latest to ensure it's not a latest driver only issue. If you have already experimented with the latest video card driver and many previous versions, please give the beta driver for your card a try.
WHEA_UNCORRECTABLE_ERROR (124)
A fatal hardware error has occurred.
If we run an !errrec on the WER structure from the 2nd parameter, we get the following:
GCACHEL2_ERR_ERR implies there was a problem with your L2 cache, possibly a faulty processor or overheating. However I do note that you are using Asus utilities and these utilities create a window between the OS and the BIOS. These utilities could cause bugchecks like this to appear. We will need more *124 dumps to conclude this.
Ensure all of your temperatures are within standard and no overheating is occurring.
I see in you have Norton installed. I'd like to get all possible software issues out of the way, so I recommend temporarily removing Norton and replacing it with Windows 8's built-in Windows Defender for troubleshooting purposes:
DPC_WATCHDOG_VIOLATION (133) I really dunno what the above error mean.
DRIVER_OVERRAN_STACK_BUFFER (f7)
Before this i was running Asus GTX 780 Nvidia Reference design. In my windows Event Viewer i got plenty of nvlddmkm event 14 id error. At that time i'm running Nvidia GTX 780 launch driver which is Geforce 320.18 WHQL. I head over to Nvidia forum to check is there any other Geforce 700 series user face such problem? I saw alot of feedback thread over there is discussing this issue, where nvlddkmk event 14 id error, random BSOD, Crashed, Sluggish, Driver timeout detection and recovery.
I tried all the driver from 314.22WHQL (modded inf), 320.18 WHQL, 326.01 WHQL, 320.49 WHQL, 326.19 Beta, 326.29 Beta and 326.41 Beta. Where i notice 314.22 WHQL is the less nvlddkmk event 14 id error produce compare to 32x.xx series driver.
Using the Nvidia Geforce 32x.xx series driver sometime give me a complete lock up on my pc, forcing me to reset the pc via power/reset button and didn't produce any minidump files or BSOD.I start monitoring my Windows Event Viewer, when there's a sluggish/unresponsive desktop/gaming i notice there's a long list of nvlddkmk event 14 id pop up.
Thus, I suspect maybe is the both Asus Geforce GTX 780 having issue thus i replace both with new Asus Geforce GTX 780 Direct CU II, still the same issue occur.
I head to asus ROG forum to seek assistance, and ask to try the latest BIOS, still the same, finally i decided to swap another new Asus Maximus VI Extreme board and upgrade my power supply from Corsair HX850 Watt to Corsair AX 1200 Watt. Yet situation didn't improve. Run all my RAM stick using memtest for 24 hours without any error produce.All the above setting is stock factory and non overclock. I'm really clueless here beside praying the upcoming Nvidia WHQL driver can solve this issue.
WHEA_UNCORRECTABLE_ERROR (124) I'm using Corsair H80i cooling, I'm using Real Temp to monitor my temperature idle 28-32 Celsius when running benchmark prime 95/AIDA 64 65 - 70 Celsius while gaming usually around 50-55 Celsius.I'm sure there's no overheating occur in my CPU. While ROG forum is not recommend we use the Asus Ai Suite III software. Guess i will uninstall the software. The Asus Ai Booster software i couldn't find it come with my motherboard or download support from Asus website. This is a new Intel Z87 motherboard and getting software dated 2008 really puzzle me.
Back for reporting, 48 hours of usage (Unigine, 3dmark, Battlefield 3, Guild Wars 2, Assasin Creed 3, AIDA 64) with SLI disable i didn't encounter any BSOD or Nvlddkmk Event 14 error in my Windows Event Viewer. Using Geforce 314.22 WHQL driver. This is with Asus Ai Suite III and Norton 360 running. Guess the Nvidia driver 32x.xx version is giving GTX 780 SLI lotsa problem. Have to wait for a proper driver to fix this issue.
If you feel or are confident enough that your problem is solved, please navigate to 'Thread Tools' at the top of the thread and select 'Mark this thread as solved'.
I did 2 stability test using AIDA64 with intel turbo boost disable and 2 BSOD occur. I hope this new BSOD minidump can give me an insight on my hardware/software problem. I'm very confident to reproduce this BSOD with AIDA64 running if you need more minidump.
Personally I suspect the Corsair Vengeance Pro CMY16GX3M2A1866C9 memory module having compatibility problem with my motherboard. I tried to run 24 hours Memtest previously without error. I went to Asus website and check the latest memory module compatible list and the Corsair Vengeance Pro CMY16GX3M2A1866C9 doesn't appear in the list. Thus I already got myself Corsair Dominator Platinum CMD16GX3M4A1866C9 which appear in Asus Maximus VI Extreme memory module list but haven't put in the system yet. I hope this is not the Intel i7 4770k CPU problem. I hope you can assist me in advice and suggestion after you done analysis the minidump file.
Both attached dumps are of the *124 bugcheck and are still reporting L2 Cache errors.
Personally I suspect the Corsair Vengeance Pro CMY16GX3M2A1866C9 memory module having compatibility problem with my motherboard. I tried to run 24 hours Memtest previously without error. I went to Asus website and check the latest memory module compatible list and the Corsair Vengeance Pro CMY16GX3M2A1866C9 doesn't appear in the list.
I don't think it's the memory, but it could be. There are many instances in which a piece of hardware other than the processor itself could cause cache errors, but with cache errors it's usually the processor on its own more than anything else.
Also, those lists are called Qualified Vendors Lists (QVL). Those lists are simply a collection of RAM kits they happened to have available to test on their board. IN actuality just because a certain kit isn't on that list doesn't mean it's not compatible.
I would install the new RAM and leave everything at DEFAULT. Don't set the RAM to XMP, etc. Simply install the new RAM and I'd even go as far afterwards to clear the CMOS just to ensure everything is at standard setting.
If you crash again, it's likely the processor itself.
Thanks for the explanation on the QVL memory module. Will put in the Corsair Dominator Platinum Memory,clear the CMOS, Load BIOS on default setting, No XMP Profile,everything stock and will start run AIDA64 stability test. Will update you on progress. Thank you.
Okay, well, at this point to me it appears to be the processor. Memtest passes and with two different kits of RAM. No hard disk errors, nothing that would remotely point to being a video related issue, etc. I have never seen cache errors be caused by anything else, but it never hurts to be sure.
Just curious this kind of CPU faulty still can by pass BIOS QLED and Q Code check come with my motherboard?
Yeah, there are different definitions of hardware in terms of being 'faulty'. It can work but to a point, you know? Not all hardware once it stars going bad will simply stop working all together.
Thanks a lot Patrick, help me greatly understand the situation better right now. I will arrange a new CPU replacement. Hopefully I can get it by today or tomorrow cause when I visit the shop yesterday they told me i7 4770k is running low at the moment. When I got the new CPU, I will run some test and keep you posted.
Replaced my Intel i7 4770k earlier, cleared CMOS, Default Setting, Disable Intel Turbo Boost Technology, still running AIDA 64 stability stress test for 8 hours now. Same software and hardware setup just replace the new CPU in. Planning to let it run maybe for a little longer just to ensure no BSOD occur. Will update you on the later test. Cheers~!
Complete 24 hours AIDA64 stability stress test without BSOD. Start to install all my application and software excluding the Asus AI Suite III and ASUS RoG forum mod don't recommend using that software.
System been up with all my application and software running without BSOD. Will let this run for few more days just to ensure everything in check. Will keep you posted. Cheers~!
: ) Thanks for the update, let's hope it stays this way.
Regards,
Patrick
Status
Not open for further replies.
You have insufficient privileges to reply here.
Related Threads
?
?
?
?
?
Tech Support Forum
4.7M posts
957.9K members
Since 2002
A forum community dedicated to tech experts and enthusiasts. Come join the discussion about articles, computer security, Mac, Microsoft, Linux, hardware, networking, gaming, reviews, accessories, and more!