Oddbean new post about | logout
 You are probably experiencing memory corruption or something similar without noticing it.

The only times I've ever had a leveldb database get corrupted were due to bad memory. I spent a fair bit of ₿ upgrading my desktop to ECC memory a few years ago due to a run-in with bad memory that corrupted files. 
 I’ve had many bad memory issues in the past and it usually always leads to system instability. If it is memory it must be a very minor issue that somehow doesn’t cause anything else to crash. 
 It's very easy for minor memory issues to result in disk corruption rather than overall system instability. You just need something like a single bad bit that only shows up sometimes, eg while hot as nostr:nprofile1qqsr6tj32zrfn7v0pu4aheaytdnnc6rluepq73ndc2tdjzus34gat9qpz4mhxue69uhhyetvv9ujuerpd46hxtnfduhswulwwv pointed out. 
 Hmm will run more tests to confirm 
 Write a memory test program that stores 4092 bytes and a CRC 
 memtest86 does this more thoroughly… ive always used that in the past 
 You want real workloads 
 Yeah but its harder to test every physical address, isn’t that the point 
 Due to interleaving and similar you already mostly achieve that 
 Memtest doesn’t tend to get your CPU hot, though. Different things can fail at different utilization levels… 
 There was that prime-something program i remember using a long time ago for that, not sure if there are more modern solutions 
 ycruncher? 
 I remember prime95 but maybe that was like 20 years ago 
 damn im old 
 whatever type of error you are having must be a more widespread issue than a few pages to be triggered so often

the ideal program would have a small pool it nonstop allocates to and deallocates from and a pool it very slowly checks and rotates allocations in/out

write random data and CRC it as I said

you then want to stop the process and get a debugger if there’s a mismatch and see the physical location along with identifying the RAM module 
 Prime95 will definitely get a CPU toasty, but you would need to ensure it's running with a mix of large and small FFT and for a while (e.g. hours/days). Small FFT maximizes CPU heat, but isn't the best for finding instabilities working with RAM. Large FFT helps there. In general, I find the overclocking community sometimes works on "vibes" rather than completely proven test methodologies, so take this with a grain of salt. 
 not that you're overclocking, but that group tends to accumulate tribal knowledge of CPU/RAM stability tests. Hardware can be "fun". A ton of variables. Even things like bios versions can cause instabilities.