Linux - HardwareThis forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
Thats a single sample from dmesg, but they don't usually fail at the same time. Usually I find I have to reset one, and then a few hours (of use) later I have to reset the other.
I've spent several hours looking these up and getting nowhere, I'd appreciate some help. Oh, and I've also tried booting with and without ACPI and APIC (I've had issues with APIC and ACPI on slightly older hardware).
that it varies could make it real tricky....I am suspecting hw detection irq conflicts.....and pls do not insert any other devices to keep it simple....some hw could also be failing...lets hope not
1) do a number of full reboots...no suspend to ram etc pls
on each reboot
Code:
su
cat /proc/interrupts > /irqN.txt...where n is 1 then 2 etc
lsmod > /modulesN.txt
after say 4 reboots see if there are any differences and post what they are or that there are none.
the modulesfile is harder to check by eye so you may want to run a diff command or I prefer a gui xxdiff.
2) to eliminate hw questions can you confirm...you have not made any hw changes recently ....and assuming you know about static electricity prevention...pls push down on all pci cards to make sure they are still properly seated pls
3) can you confirm if these devices worked correctly under a different operating system or distro pls
that it varies could make it real tricky....I am suspecting hw detection irq conflicts.....and pls do not insert any other devices to keep it simple....some hw could also be failing...lets hope not
1) do a number of full reboots...no suspend to ram etc pls
on each reboot
Code:
su
cat /proc/interrupts > /irqN.txt...where n is 1 then 2 etc
lsmod > /modulesN.txt
after say 4 reboots see if there are any differences and post what they are or that there are none.
the modulesfile is harder to check by eye so you may want to run a diff command or I prefer a gui xxdiff.
2) to eliminate hw questions can you confirm...you have not made any hw changes recently ....and assuming you know about static electricity prevention...pls push down on all pci cards to make sure they are still properly seated pls
3) can you confirm if these devices worked correctly under a different operating system or distro pls
1. I'll get back to you on this (its currently working, and Its being used to watch TV )
2+3. This is the fun bit, all the hardware has changed, the box used to be my main desktop, but I bought a new one 6 months ago, and this machine has sat a little. I've since installed a GbE nic, which is new, and the TV card which worked fine the last time I used it.
I have noticed that my server, with another identical GbE card has similar pci errors when under SUPER HIGH load. But it hasn't happened in a while. It makes me thing this particular error stems from bad drivers? But nothing explains the tv card errors so far.
1. I'll get back to you on this (its currently working, and Its being used to watch TV )
All done now, they are all the same, at least with ACPI and APIC off, there will be no dynamic mapping of IRQs, and it seems I got lucky with slot placement, since no major devices seem to be sharing IRQs.
In case it might help, heres a bunch of current hopefully relevant information:
thinking out loud...I wonder if you need a module preload for the ethernet before the tv card?...I would like to know if your dmesg shows the current working sequence...whatever it is.....and then when you have a fail...keep a copy of that and we might see the hw is detected out-of-sequence...leading to a module load fail?
of course with tricky hw, it may the one we do not see....that is the issue...so its nice to see your working dmesg.
thinking out loud...I wonder if you need a module preload for the ethernet before the tv card?...I would like to know if your dmesg shows the current working sequence...whatever it is.....and then when you have a fail...keep a copy of that and we might see the hw is detected out-of-sequence...leading to a module load fail?
of course with tricky hw, it may the one we do not see....that is the issue...so its nice to see your working dmesg.
Well, when I said "currently working", it works for a while, then I get an error some time later, usually hours later.
yeah thanks for that...and for line numbers I use F11 with my text editor to get them.
a quick look has these lines of interest
lines 339 & 340 have i2c properties not installing....its possible if you did a vanilla kernel and enabled full i2c support for this card...you MAY remove these errors.
2) lines 354 357 appear to be the source of your hw issue.
3) the last time I checked Gentoo is supposed to get you to compile your own kernel, have you done so?
it might just be an easy step to read your linux documentation for drivers and enable more in or allow more modules....then do your gentoo modules etc
but I do not use Gentoo so can not help with those steps.
yeah thanks for that...and for line numbers I use F11 with my text editor to get them.
a quick look has these lines of interest
lines 339 & 340 have i2c properties not installing....its possible if you did a vanilla kernel and enabled full i2c support for this card...you MAY remove these errors.
2) lines 354 357 appear to be the source of your hw issue.
3) the last time I checked Gentoo is supposed to get you to compile your own kernel, have you done so?
it might just be an easy step to read your linux documentation for drivers and enable more in or allow more modules....then do your gentoo modules etc
but I do not use Gentoo so can not help with those steps.
This is a debian sid/unstable box.
edit: also, the bt878* modules are from alsa to handle the audio. Which, odly enough, works just fine, even with the odd exit error. Theres two separate devices on the TV card, the Video Capture Device, and the Audio Capture Device, both of which seem to work, untill they stop working...
can you see any pattern to what you were doing at the same time as when either of those devices fails?
I am now thinking, maybe its a sound server issue, I do not use Debian either, but are you using KDE by any chance? In which case we can fix some sound server issues thru the control center
can you see any pattern to what you were doing at the same time as when either of those devices fails?
I am now thinking, maybe its a sound server issue, I do not use Debian either, but are you using KDE by any chance? In which case we can fix some sound server issues thru the control center
It is running KDE, but the onboard sound supports multiple hardware streams, so artsd can have its own dedicated channel while mythtv gets one as well. And its not the onboard sound that's having the problem, its the capture device on the WinTV card.
fair enough...does it fail after running after a certain amount of time?
I know you have already attempted to eliminate power saving issues but I am running out of ideas.
does it happen in synch with your crontab jobs?
/etc/crontab....first 4 numbers are mm hh
I'll have to wait and see. I've made some changes, removed half my ram (didnt need to, but eh), and swapped out the Radeon 9600xt for an older 9200 that I feel more confident about (the 9600xt had its fan replaced after it failed, its possible the 9600xt might have been damaged in some way, even though it "seems" to work).
I've been watching some tv on it for a few hours now, with no pci errors. But that's not saying much, the errors pop up semi randomly.
I'll have to wait and see. I've made some changes, removed half my ram (didnt need to, but eh), and swapped out the Radeon 9600xt for an older 9200 that I feel more confident about (the 9600xt had its fan replaced after it failed, its possible the 9600xt might have been damaged in some way, even though it "seems" to work).
I've been watching some tv on it for a few hours now, with no pci errors. But that's not saying much, the errors pop up semi randomly.
Still getting these darned errors. Its really annoying.
As for cron jobs, no, it doesn't seem to be occurring with cron jobs.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.