site  contact  subhomenews

Kernel 2.6.37 disaster

January 07, 2011 — BarryK
Well, where do I start?... so many problems.

The only change that I made to configuration, relative to when I configured the 2.6.35 kernel, was to disable the Big Kernel Lock (BKL) as this is supposed to be one of the significant improvements of this kernel.

I compiled the kernel with a full HD installation of Wary 5.0. After compiling, rebooted, and got this on the screen:

[Hardware Error]: No human readable MCE decoding support on this CPU type.
[Hardware Error]: Run the message through 'meclog --ascii' to decode.


Well, all previous kernels did not have that problem. The kernel log when booting my 2.6.31.4 kernel:

mce: CPU supports 4 MCE banks

...no error message.

A quick google showed that others have been getting this MCE error with the 2.6.36 kernel, for example:

https://bugzilla.novell.com/show_bug.cgi?id=651947

Then it just hung. No error message, no kernel panic, kernel booting just stopped.

I pressed the Reset button on the PC, and this time it did boot. But, various mysterious things, like:

1. The USB drivers had loaded, but mouse simply did not work. No error messages, all messages look fine, including when Xorg starts, just that the mouse pointer doesn't move.

2. The core ALSA driver loaded, but the hardware-specific driver did not load. Again, no hint why not.

Oh dear, what have they done to the kernel!!!!

Comments

Kernel MCE code
Username: BarryK
Well, it looks like this guy has rewritten the MCE (Machine Check Exception) handling code in the kernel: http://www.webservertalk.com/message2320089.html Note, my PC has a very plain-vanilla Intel Celeron CPU. It is an ASUSTek motherboard, manufactured in 2006, with VIA support chips. This PC has been one of my workhorses, and has always worked fine with previous puppies. As to the rest of the problems, I have no idea...

Recompile, I'm told
Username: gcmartin
"Barry, I'm certainly NO GURU on kernels BUT, my specialist tells me to tell you to recompile: leave in BKL and add LVM. Says you "should be good to go". Hope this helps

2.6.36.2
Username: Iguleder
"For some reason 2.6.36.2 seems very good here, on a 2007 dual-core PC and my 2010 netbook. I'm recompiling it for spup, it's patched with BFS and other goodies to make it uber-fast. It seems modem drivers don't like this one ... and ndiswrapper needs to be patched to work with it. I just bought www.iguleder.info, so if you want my kernel sources, they should be available in iguleder.info/sources once I'm done recompiling it :)

Third-party repos
Username: Iguleder
"Barry, Woof does not support third-party repos: their package lists are not copied to /root/.packages. Line 1026 in 3builddistro: [code]cp -f ../Packages-puppy-[0-9]-official rootfs-complete/root/.packages/ cp -f ../Packages-puppy-woof-official rootfs-complete/root/.packages/[/code] Why don't you change to this? [code]cp -f ../Packages-*-official rootfs-complete/root/.packages/[/code] You could also write a loop that copies just the lists for the repos available in PPM to save some space. I wrote a browser installer that automatically finds the most appropriate PET according to the repos in DISTRO_PET_REPOS and it does not detect the packages under my repo because the package list is missing. That's how I found this so-called bug.

re ndiswrapper
Username: BarryK
"Iguleder, Yeah, that package database copying code in Woof is messy, I need to rewrite it. Patch for ndiswrapper? Do you know where I can get that patch?

A Patch
Username: Hagur
"Take a Look on Last Question on http://mcelog.org/faq.html "I get "kernel hardware error no human readable mce decoding support on this cpu type" they have a Patch for this bug.


Tags: puppy