erlug
[Top] [All Lists]

Re: [Erlug] e1000 watchdog

To: ERlug - Lista Pubblica <erlug@xxxxxxxxxxxxxx>
Subject: Re: [Erlug] e1000 watchdog
From: Daniele Palumbo <daniele@xxxxxxxxxxxx>
Date: Wed, 26 Oct 2005 23:36:06 +0200
On Wednesday 26 October 2005 01:37, Davide Bolcioni wrote:
> La e1000 mi mette dei brutti ricordi:
>
>    https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=149887
>
> anche se il problema mi sembra diverso.

hai poi risolto cambiando scheda?

comunque, ho scaricato i driver intel 6.1.16 (o quel cazzo che è):
---
Oct 26 21:50:42 sulu kernel: e1000: eth1: e1000_clean_tx_irq: Detected Tx Unit 
Hang
Oct 26 21:50:42 sulu kernel:   TDH                  <e1>
Oct 26 21:50:42 sulu kernel:   TDT                  <e1>
Oct 26 21:50:42 sulu kernel:   next_to_use          <e1>
Oct 26 21:50:42 sulu kernel:   next_to_clean        <cf>
Oct 26 21:50:42 sulu kernel: buffer_info[next_to_clean]
Oct 26 21:50:42 sulu kernel:   dma                  <98789e>
Oct 26 21:50:42 sulu kernel:   time_stamp           <92309>
Oct 26 21:50:42 sulu kernel:   next_to_watch        <cf>
Oct 26 21:50:42 sulu kernel:   jiffies              <923fc>
Oct 26 21:50:42 sulu kernel:   next_to_watch.status <0>
Oct 26 21:50:44 sulu kernel: e1000: eth1: e1000_clean_tx_irq: Detected Tx Unit 
Hang
Oct 26 21:50:44 sulu kernel:   TDH                  <e1>
Oct 26 21:50:44 sulu kernel:   TDT                  <e1>
Oct 26 21:50:44 sulu kernel:   next_to_use          <e1>
Oct 26 21:50:44 sulu kernel:   next_to_clean        <cf>
Oct 26 21:50:44 sulu kernel: buffer_info[next_to_clean]
Oct 26 21:50:44 sulu kernel:   dma                  <98789e>
Oct 26 21:50:44 sulu kernel:   time_stamp           <92309>
Oct 26 21:50:44 sulu kernel:   next_to_watch        <cf>
Oct 26 21:50:44 sulu kernel:   jiffies              <924c4>
Oct 26 21:50:44 sulu kernel:   next_to_watch.status <0>
Oct 26 21:50:46 sulu kernel: e1000: eth1: e1000_clean_tx_irq: Detected Tx Unit 
Hang
Oct 26 21:50:46 sulu kernel:   TDH                  <e1>
Oct 26 21:50:46 sulu kernel:   TDT                  <e1>
Oct 26 21:50:46 sulu kernel:   next_to_use          <e1>
Oct 26 21:50:46 sulu kernel:   next_to_clean        <cf>
Oct 26 21:50:46 sulu kernel: buffer_info[next_to_clean]
Oct 26 21:50:46 sulu kernel:   dma                  <98789e>
Oct 26 21:50:46 sulu kernel:   time_stamp           <92309>
Oct 26 21:50:46 sulu kernel:   next_to_watch        <cf>
Oct 26 21:50:46 sulu kernel:   jiffies              <9258c>
Oct 26 21:50:46 sulu kernel:   next_to_watch.status <0>
Oct 26 21:50:48 sulu kernel: e1000: eth1: e1000_clean_tx_irq: Detected Tx Unit 
Hang
Oct 26 21:50:48 sulu kernel:   TDH                  <e1>
Oct 26 21:50:48 sulu kernel:   TDT                  <e1>
Oct 26 21:50:48 sulu kernel:   next_to_use          <e1>
Oct 26 21:50:48 sulu kernel:   next_to_clean        <cf>
Oct 26 21:50:48 sulu kernel: buffer_info[next_to_clean]
Oct 26 21:50:48 sulu kernel:   dma                  <98789e>
Oct 26 21:50:48 sulu kernel:   time_stamp           <92309>
Oct 26 21:50:48 sulu kernel:   next_to_watch        <cf>
Oct 26 21:50:48 sulu kernel:   jiffies              <92654>
Oct 26 21:50:48 sulu kernel:   next_to_watch.status <0>
Oct 26 21:50:50 sulu kernel: e1000: eth1: e1000_clean_tx_irq: Detected Tx Unit 
Hang
Oct 26 21:50:50 sulu kernel:   TDH                  <e1>
Oct 26 21:50:50 sulu kernel:   TDT                  <e1>
Oct 26 21:50:50 sulu kernel:   next_to_use          <e1>
Oct 26 21:50:50 sulu kernel:   next_to_clean        <cf>
Oct 26 21:50:50 sulu kernel: buffer_info[next_to_clean]
Oct 26 21:50:50 sulu kernel:   dma                  <98789e>
Oct 26 21:50:50 sulu kernel:   time_stamp           <92309>
Oct 26 21:50:50 sulu kernel:   next_to_watch        <cf>
Oct 26 21:50:50 sulu kernel:   jiffies              <9271c>
Oct 26 21:50:50 sulu kernel:   next_to_watch.status <0>
Oct 26 21:50:51 sulu kernel: NETDEV WATCHDOG: eth1: transmit timed out
Oct 26 21:50:54 sulu kernel: e1000: eth1: e1000_watchdog: NIC Link is Up 1000 
Mbps Full Duplex
Oct 26 21:50:56 sulu kernel: e1000: eth1: e1000_watchdog: NIC Link is Down
Oct 26 21:50:59 sulu kernel: e1000: eth1: e1000_watchdog: NIC Link is Up 1000 
Mbps Full Duplex
---

attuale /proc/interrupts
---
root@sulu:/usr/src/e1000-6.1.16/src# cat /proc/interrupts
           CPU0
  0:    1253358          XT-PIC  timer
  1:          2          XT-PIC  keyboard
  2:          0          XT-PIC  cascade
  7:         65          XT-PIC  usb-uhci, usb-uhci
  8:          1          XT-PIC  rtc
  9:          0          XT-PIC  acpi
 10:      69455          XT-PIC  ehci_hcd, eth0
 11:     979972          XT-PIC  libata, usb-uhci, usb-uhci, eth1
 14:      55308          XT-PIC  ide0
 15:          5          XT-PIC  ide1
NMI:          0
ERR:          0
---

ora, è un problema di irq, a quanto posso capire.
ho provato a cambiare tutti gli slot pci, abilitare e disabilitare l'apic, ...
mi manca solo da dire a linux di usare le impostazioni pci del bios.

esperienze simili:
http://lkml.org/lkml/2005/8/15/75 (e thread)
http://groups.google.it/group/novell.support.suse.linux.enterprise-server/browse_thread/thread/e011d19a4080fde/25bd804fba57c6fc?lnk=st&q=%22e1000_clean_tx_irq:
+Detected+Tx+Unit+Hang%22&rnum=1&hl=it#25bd804fba57c6fc


in allegato l'output di "vmstat 1".
l'ultima riga che ricevo, prima del timeout, è
---
procs -----------memory---------- ---swap-- -----io---- --system-- ----cpu----
1  0      0   4824  61040 263408    0    0  8712   112 4946  3464  0 20 80  0
---


bye
d.
-- 
PGP Key-ID: 0xF482D454
--
to boldly go where no man has gone before.

Attachment: vmstat
Description: Text document

Attachment: pgpw9IdR1f2VR.pgp
Description: PGP signature

<Prev in Thread] Current Thread [Next in Thread>