SearchFAQMemberlist Log in
Reply to topic Page 1 of 1
DDS4 DAT Tape Drive Failure?
Author Message
Post DDS4 DAT Tape Drive Failure? 
Attachments: Message as HTML

Hi All =20
=20
I have been happily doing nightly full backups on one of my Linux Serv=
ers for a few months now. However, since the beginning of this week I=
have been getting the following errors. Has my Tape Drive failed=3F =
Has a cable got broken=3F Could somebody give me a How-To in order t=
o Troubleshoot my tape drive please=3F =20
=20
My failed log is below:- =20
=20
07-Mar 21:30 Server-dir: RunBefore: Connecting to Director localhost:9=
101 =20
07-Mar 21:30 Server-dir: RunBefore: 1000 OK: Server-dir Version: 1.36.=
2 (28 February 2005) =20
07-Mar 21:30 Server-dir: RunBefore: Enter a period to cancel a command=
. =20
07-Mar 21:30 Server-dir: RunBefore: mount storage=3DDDS4 =20
07-Mar 21:30 Server-dir: RunBefore: Using default Catalog name=3DMyCat=
alog DB=3Dbacula =20
07-Mar 21:32 Server-dir: RunBefore: 3901 open device failed: ERR=3Ddev=
.c:289 stored: unable to open device /dev/nst0: ERR=3DInput/output err=
or 07-Mar 21:32 Server-dir: RunBefore: =20
07-Mar 21:32 Server-dir: RunBefore: You have messages. =20
07-Mar 21:32 Server-dir: RunBefore: exit 07-Mar 21:32 Server-dir: Star=
t Backup JobId 900, Job=3DHaverhillJob.2007-03-07=5F21.30.00 =20
07-Mar 21:34 Server-fd: HaverhillJob.2007-03-07=5F21.30.00 Fatal error=
: job.c:1665 Bad response to Append Data command. Wanted 3000 OK data =
, got 3903 Error append data =20
07-Mar 21:34 Server-sd: HaverhillJob.2007-03-07=5F21.30.00 Fatal error=
: device.c:317 Unable to open device /dev/nst0. ERR=3Ddev.c:289 stored=
: unable to open device /dev/nst0: ERR=3DInput/output error =20
07-Mar 21:34 Server-dir: HaverhillJob.2007-03-07=5F21.30.00 Error: Bac=
ula 1.36.2 (28Feb05): 07-Mar-2007 21:34:09 =20
JobId: 900 =20
Job: HaverhillJob.2007-03-07=5F21.30.00 =20
Backup Level: Full =20
Client: Server-fd =20
FileSet: "HaverhillFiles" 2006-11-18 00:08:52 =20
Pool: "HaverhillPool" =20
Storage: "DDS4" =20
Start time: 07-Mar-2007 21:30:02 =20
End time: 07-Mar-2007 21:34:09 =20
FD Files Written: 0 =20
SD Files Written: 0 =20
FD Bytes Written: 0 =20
SD Bytes Written: 0 =20
Rate: 0.0 KB/s =20
Software Compression: None =20
Volume name(s): =20
Volume Session Id: 1 =20
Volume Session Time: 1173289086 =20
Last Volume Bytes: 1 =20
Non-fatal FD errors: 0 =20
SD Errors: 0 =20
FD termination status: Error =20
SD termination status: Error =20
Termination: *** Backup Error *** =20
=20
=20
Here is part of my "dmesg" with my Tape Drive detected:- =20
=20
scsi2 : Initio INI-A100U2W SCSI device driver; Revision: 1.02d
Vendor: HP Model: C5683A Rev: C111
Type: Sequential-Access ANSI SCSI revision: 02
st: Version 20040403, fixed bufsize 32768, s/g segs 256
Attached scsi tape st0 at scsi2, channel 0, id 3, lun 0
st0: try direct i/o: yes (alignment 512 B), max page reachable by HBA =
1048575 =20
=20
=20
Note that mt commands (such as mt -f /dev/nst0 rewind or mt -f /dev/st=
0 rewind) also give me input/output errors, when I turn Bacula off. =
=20
=20
Many Thanks =20
Alan To =20

Bernhard and Company Ltd
Bilton Road * Rugby * England * CV22 7DT
( +44 1788 811600 * Fax +44 1788 812640
( Toll Free (USA) 1-888 GRIND IT
Website: http://www.bernhard.co.uk =20
=20
=20

Post DDS4 DAT Tape Drive Failure? 
Attachments: Message as HTML

Hi

My Scsi Card is an Adaptec ASC-19160, and I have a HP Surestore D=
AT 40i (C5686A). Both items are over four years old (Oct 2002) s=
o maybe either/both items do need replacing. It's a last resort=20=
thing to do though. Are there any troubleshooting tips first?

-- Regards
Alan To

From: Xeos Laenor [mailto:xeoslaenor < at > gm...]
Sent: 08 March 2007 11:10
To: Alan To
Subject: Re: [Bacula-users] DDS4 DAT Tape Drive Failure?

Hi,
I have had the same error.
What is your scsi adapter? Adaptec?

Franck

2007/3/8, Alan To <alan.to < at > be...>:
Hi All

I have been happily doing nightly full backups on one of my Linux=
Servers for a few months now. However, since the beginning of t=
his week I have been getting the following errors. Has my Tape D=
rive failed? Has a cable got broken? Could somebody give me a H=
ow-To in order to Troubleshoot my tape drive please?

My failed log is below:-

07-Mar 21:30 Server-dir: RunBefore: Connecting to Director localh=
ost:9101
07-Mar 21:30 Server-dir: RunBefore: 1000 OK: Server-dir Version:=20=
1.36.2 (28 February 2005)
07-Mar 21:30 Server-dir: RunBefore: Enter a period to cancel a co=
mmand.
07-Mar 21:30 Server-dir: RunBefore: mount storage=3DDDS4
07-Mar 21:30 Server-dir: RunBefore: Using default Catalog name=3D=
MyCatalog DB=3Dbacula
07-Mar 21:32 Server-dir: RunBefore: 3901 open device failed: ERR=3D=
dev.c:289 stored: unable to open device /dev/nst0: ERR=3DInput/ou=
tput error 07-Mar 21:32 Server-dir: RunBefore:
07-Mar 21:32 Server-dir: RunBefore: You have messages.
07-Mar 21:32 Server-dir: RunBefore: exit 07-Mar 21:32 Server-dir:=
Start Backup JobId 900, Job=3DHaverhillJob.2007-03-07_21.30.00
07-Mar 21:34 Server-fd: HaverhillJob.2007-03-07_21.30.00 Fatal er=
ror: job.c:1665 Bad response to Append Data command. Wanted 3000=20=
OK data , got 3903 Error append data
07-Mar 21:34 Server-sd: HaverhillJob.2007-03-07_21.30.00 Fatal er=
ror: device.c:317 Unable to open device /dev/nst0. ERR=3Ddev.c:28=
9 stored: unable to open device /dev/nst0: ERR=3DInput/output err=
or
07-Mar 21:34 Server-dir: HaverhillJob.2007-03-07_21.30.00 Error:=20=
Bacula 1.36.2 (28Feb05): 07-Mar-2007 21:34:09
JobId: 900
Job: HaverhillJob.2007-03-07_21.30.00
Backup Level: Full
Client: Server-fd
FileSet: "HaverhillFiles" 2006-11-18 00:08:52
Pool: "HaverhillPool"
Storage: "DDS4"
Start time: 07-Mar-2007 21:30:02
End time: 07-Mar-2007 21:34:09
FD Files Written: 0
SD Files Written: 0
FD Bytes Written: 0
SD Bytes Written: 0
Rate: 0.0 KB/s
Software Compression: None
Volume name(s):
Volume Session Id: 1
Volume Session Time: 1173289086
Last Volume Bytes: 1
Non-fatal FD errors: 0
SD Errors: 0
FD termination status: Error
SD termination status: Error
Termination: *** Backup Error ***

Here is part of my "dmesg" with my Tape Drive detected:-

scsi2 : Initio INI-A100U2W SCSI device driver; Revision: 1.02d
Vendor: HP Model: C5683A Rev: C111
Type: Sequential-Access ANSI SCSI revision: 02
st: Version 20040403, fixed bufsize 32768, s/g segs 256
Attached scsi tape st0 at scsi2, channel 0, id 3, lun 0
st0: try direct i/o: yes (alignment 512 B), max page reachable by=
HBA 1048575

Note that mt commands (such as mt -f /dev/nst0 rewind or mt -f /d=
ev/st0 rewind) also give me input/output errors, when I turn Bacu=
la off.

Many Thanks
Alan To

Bernhard and Company Ltd
Bilton Road * Rugby * England * CV22 7DT
( +44 1788 811600 * Fax +44 1788 812640
( Toll Free (USA) 1-888 GRIND IT
Website: http://www.bernhard.co.uk

-----------------------------------------------------------------=
--------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to=
share your
opinions on IT & business topics through brief surveys-and earn c=
ash
http://www.techsay.com/default.php?page=3Djoin.php&p=3Dsourceforg=
e&CID=3DDEVDEV
_______________________________________________
Bacula-users mailing list
Bacula-users < at > li...
https://lists.sourceforge.net/lists/listinfo/bacula-users

Post DDS4 DAT Tape Drive Failure? 
Attachments: Message as HTML

=0D

07-Mar 21:32 Server-dir: RunBefore: 3901 open device failed:
ERR=3Ddev.c:289 stored: unable to open device /dev/nst0: ERR=3DInput/output
error 07-Mar 21:32 Server-dir: RunBefore:=0D
07-Mar 21:32 Server-dir: RunBefore: You have messages.
07-Mar 21:32 Server-dir: RunBefore: exit 07-Mar 21:32
Server-dir: Start Backup JobId 900, Job=3DHaverhillJob.2007-03-07_21.30.00
07-Mar 21:34 Server-fd: HaverhillJob.2007-03-07_21.30.00 Fatal
error: job.c:1665 Bad response to Append Data command. Wanted 3000 OK
data , got 3903 Error append data
07-Mar 21:34 Server-sd: HaverhillJob.2007-03-07_21.30.00 Fatal
error: device.c:317 Unable to open device /dev/nst0. ERR=3Ddev.c:289
stored: unable to open device /dev/nst0: ERR=3DInput/output error
=0D
Here is part of my "dmesg" with my Tape Drive detected:-
=0D
scsi2 : Initio INI-A100U2W SCSI device driver; Revision: 1.02d
Vendor: HP Model: C5683A Rev: C111
Type: Sequential-Access ANSI SCSI revision: 02
st: Version 20040403, fixed bufsize 32768, s/g segs 256
Attached scsi tape st0 at scsi2, channel 0, id 3, lun 0
st0: try direct i/o: yes (alignment 512 B), max page reachable
by HBA 1048575
=0D
=0D
Note that mt commands (such as mt -f /dev/nst0 rewind or mt -f
/dev/st0 rewind) also give me input/output errors, when I turn Bacula
off.
=0D

=0D
Hi Alan,
=0D
Sounds like its either dead or just unplugged. Done any moving around
lately??? What do you have in /proc/scsi/scsi ??? (# cat
/proc/scsi/scsi)
=0D
It should list (depending upon your setup) the tape drive and maybe some
SCSI/SATA/USB hard disks too, in fact anything that uses the generic
SCSI kernel module.
=0D
If your tape drive isn't listed then the kernel knows nothing of it and
it just ain't going to work (and hence why mt and bacula fail). If it
is, then it could be something more difficult to pin down.
=0D
If it isn't listed, then you'll need to check the usual things (powered
on or not, everything plugged in correctly, SCSI termination, etc, etc).
Is there anything else plugged into the HBA that could have died???
Could the HBA be faulty or the drive??? Just try some usual checks, move
the drive to another HBA to see if its OK or the HBA is dead, check its
not been in-advertently powered off (cable unplugged, tripped over
cable, etc, etc)...
=0D
And so on and so on...
=0D
HTH
=0D
Dan
=0D
--

Dan Hawker
Linux System Administrator
Astrium
http://www.astrium.eads.net <http://www.astrium.eads.net/>=0D

--=0D

This email (including any attachments) may contain confidential and/or=
privileged information or information otherwise protected from disclosure.
If you are not the intended recipient, please notify the sender=
immediately, do not copy this message or any attachments and do not use it=
for any purpose or disclose its content to any person, but delete this=
message and any attachments from your system.
Astrium disclaims any and all liability if this email transmission was=
virus corrupted, altered or falsified.
---------------------------------------------------------------------
Astrium Limited, Registered in England and Wales No. 2449259
Registered Office: Gunnels Wood Road, Stevenage, Hertfordshire, SG1 2AS,=
England

Post DDS4 DAT Tape Drive Failure? 
Attachments: Message as HTML

Hi Dan, Hi All

It is slightly difficult to check [movement] as the server is on=20=
a separate site to where I am based. Anyway I have just catalogu=
ed my scsi devices and yes, it is there/detected:
# cat /proc/scsi/scsi
Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
Vendor: ADAPTEC Model: AAR-2410SA RAID5 Rev: V1.0
Type: Direct-Access ANSI SCSI revision: 02=

Host: scsi1 Channel: 00 Id: 00 Lun: 00
Vendor: ATA Model: WDC WD400JD-00LS Rev: 06.0
Type: Direct-Access ANSI SCSI revision: 05=

Host: scsi2 Channel: 00 Id: 03 Lun: 00
Vendor: HP Model: C5683A Rev: C111
Type: Sequential-Access ANSI SCSI revision: 02=

If it wasn't detected then I would not be able to do the mt comma=
nd (I would get the error "No such device or address" instead) e.=
g. (I don't have a second tape drive so nst1 has nothing there):
# mt -f /dev/nst0 rewind
/dev/nst0: Input/output error
# mt -f /dev/nst1 rewind
/dev/nst1: No such device or address

What's next can I troubleshoot, prior to purchasing new hardware=20=
or going to the site?

-- Many Thanks
Alan To

From: HAWKER, Dan (external) [mailto:Dan.HAWKER < at > uk....=
net]
Sent: 08 March 2007 11:40
To: Alan To;bacula-users < at > li...
Subject: RE: [Bacula-users] DDS4 DAT Tape Drive Failure?

I have been happily doing nightly full backups on one of my Linux=
Servers for a few months now. However, since the beginning of t=
his week I have been getting the following errors. Has my Tape D=
rive failed? Has a cable got broken? Could somebody give me a H=
ow-To in order to Troubleshoot my tape drive please?

07-Mar 21:32 Server-dir: RunBefore: 3901 open device failed: ERR=3D=
dev.c:289 stored: unable to open device /dev/nst0: ERR=3DInput/ou=
tput error 07-Mar 21:32 Server-dir: RunBefore:
07-Mar 21:32 Server-dir: RunBefore: You have messages.
07-Mar 21:32 Server-dir: RunBefore: exit 07-Mar 21:32 Server-dir:=
Start Backup JobId 900, Job=3DHaverhillJob.2007-03-07_21.30.00
07-Mar 21:34 Server-fd: HaverhillJob.2007-03-07_21.30.00 Fatal er=
ror: job.c:1665 Bad response to Append Data command. Wanted 3000=20=
OK data , got 3903 Error append data
07-Mar 21:34 Server-sd: HaverhillJob.2007-03-07_21.30.00 Fatal er=
ror: device.c:317 Unable to open device /dev/nst0. ERR=3Ddev.c:28=
9 stored: unable to open device /dev/nst0: ERR=3DInput/output err=
or

Here is part of my "dmesg" with my Tape Drive detected:-

scsi2 : Initio INI-A100U2W SCSI device driver; Revision: 1.02d
Vendor: HP Model: C5683A Rev: C111
Type: Sequential-Access ANSI SCSI revision: 02
st: Version 20040403, fixed bufsize 32768, s/g segs 256
Attached scsi tape st0 at scsi2, channel 0, id 3, lun 0
st0: try direct i/o: yes (alignment 512 B), max page reachable by=
HBA 1048575

Note that mt commands (such as mt -f /dev/nst0 rewind or mt -f /d=
ev/st0 rewind) also give me input/output errors, when I turn Bacu=
la off.

Hi Alan,

Sounds like its either dead or just unplugged. Done any moving ar=
ound lately??? What do you have in /proc/scsi/scsi ??? (# cat /p=
roc/scsi/scsi)

It should list (depending upon your setup) the tape drive and may=
be some SCSI/SATA/USB hard disks too, in fact anything that uses=20=
the generic SCSI kernel module.

If your tape drive isn't listed then the kernel knows nothing of=20=
it and it just ain't going to work (and hence why mt and bacula f=
ail). If it is, then it could be something more difficult to pin=20=
down.

If it isn't listed, then you'll need to check the usual things (p=
owered on or not, everything plugged in correctly, SCSI terminati=
on, etc, etc). Is there anything else plugged into the HBA that c=
ould have died??? Could the HBA be faulty or the drive??? Just tr=
y some usual checks, move the drive to another HBA to see if its=20=
OK or the HBA is dead, check its not been in-advertently powered=20=
off (cable unplugged, tripped over cable, etc, etc)...

And so on and so on...

HTH

Dan

--

Dan Hawker
Linux System Administrator
Astrium
http://www.astrium.eads.net

--
This email (including any attachments) may contain confidential a=
nd/or privileged information or information otherwise protected f=
rom disclosure.
If you are not the intended recipient, please notify the sender i=
mmediately, do not copy this message or any attachments and do no=
t use it for any purpose or disclose its content to any person, b=
ut delete this message and any attachments from your system.
Astrium disclaims any and all liability if this email transmissio=
n was virus corrupted, altered or falsified.
-----------------------------------------------------------------=
----
Astrium Limited, Registered in England and Wales No. 2449259
Registered Office: Gunnels Wood Road, Stevenage, Hertfordshire, S=
G1 2AS, England

Post DDS4 DAT Tape Drive Failure? 
On Thu, 8 Mar 2007 12:06:45 +0000, Alan To said:

Hi Dan, Hi All

It is slightly difficult to check [movement] as the server is on
a separate site to where I am based. Anyway I have just catalogued my scsi
devices and yes, it is there/detected:
# cat /proc/scsi/scsi
Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
Vendor: ADAPTEC Model: AAR-2410SA RAID5 Rev: V1.0
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi1 Channel: 00 Id: 00 Lun: 00
Vendor: ATA Model: WDC WD400JD-00LS Rev: 06.0
Type: Direct-Access ANSI SCSI revision: 05
Host: scsi2 Channel: 00 Id: 03 Lun: 00
Vendor: HP Model: C5683A Rev: C111
Type: Sequential-Access ANSI SCSI revision: 02

If it wasn't detected then I would not be able to do the mt command (I would
get the error "No such device or address" instead) e.g. (I don't have a
second tape drive so nst1 has nothing there):
# mt -f /dev/nst0 rewind
/dev/nst0: Input/output error
# mt -f /dev/nst1 rewind
/dev/nst1: No such device or address

What's next can I troubleshoot, prior to purchasing new hardware=20=
or going to the site?

Is that /proc/scsi/scsi after rebooting? If not, possibly it is old info and
the device is now not connected. Any syslog/console messages when you run mt
rewind?

__Martin

Display posts from previous:
Reply to topic Page 1 of 1
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
  


Magic SEO URL for phpBB