SearchFAQMemberlist Log in
Reply to topic Page 1 of 1
Rif: Re: Change tape problem
Author Message
Post Rif: Re: Change tape problem 
<br><font size=2 face="sans-serif">Hi Arno,</font>
<br><font size=2 face="sans-serif">thank you very much for your answer.
I will try asap the tests you are suggesting. By the way, I purged the
volumes involved in the error shown in the original message (it was the
third try), restarted the backup job and here is the (correct) result.</font>
<br>
<br><font size=2><tt>25-feb 19:55 bacula-dir: Start Backup JobId 12927,
Job=webfs3-job.2007-02-25_19.55.40<br>
25-feb 19:55 bacula-dir: Recycled volume "web-004"<br>
25-feb 19:55 webfs3: ClientRunBeforeJob: run command "/root/restartsmb"<br>
25-feb 19:55 webfs3: ClientRunBeforeJob: Shutting down SMB services: [
OK ]<br>
25-feb 19:55 webfs3: ClientRunBeforeJob: smbd: nessun processo terminato<br>
25-feb 19:55 webfs3: ClientRunBeforeJob: smbd: nessun processo terminato<br>
25-feb 19:55 webfs3: ClientRunBeforeJob: Starting SMB services: [ OK
]<br>
25-feb 19:55 webfs3: ClientRunBeforeJob: [ OK ]<br>
25-feb 19:55 bacula-sd: 3307 Issuing autochanger "unload slot 7, drive
0" command.<br>
25-feb 19:57 bacula-sd: 3304 Issuing autochanger "load slot 3, drive
0" command.<br>
25-feb 19:57 bacula-sd: 3305 Autochanger "load slot 3, drive 0",
status is OK.<br>
25-feb 19:57 bacula-sd: 3301 Issuing autochanger "loaded? drive 0"
command.<br>
25-feb 19:57 bacula-sd: 3302 Autochanger "loaded? drive 0", result
is Slot 3.<br>
25-feb 19:57 bacula-sd: Recycled volume "web-004" on device "LTO1"
(/dev/lto1), all previous data lost.<br>
webfs3: /proc is a different filesystem. Will not descend
from / into /proc<br>
webfs3: /boot is a different filesystem. Will not descend
from / into /boot<br>
webfs3: /dev is a different filesystem. Will not descend
from / into /dev<br>
webfs3: /var/lib/nfs/rpc_pipefs is a different filesystem.
Will not descend from / into /var/lib/nfs/rpc_pipefs<br>
webfs3: /sys is a different filesystem. Will not descend
from / into /sys<br>
webfs3: /uno is a different filesystem. Will not descend
from / into /uno<br>
26-feb 04:14 bacula-sd: End of Volume "web-004" at 594:6519 on
device "LTO1" (/dev/lto1). Write of 64512 bytes got -1.<br>
26-feb 04:14 bacula-sd: Re-read of last block succeeded.<br>
26-feb 04:14 bacula-sd: End of medium on Volume "web-004" Bytes=594,382,602,240
Blocks=9,213,519 at 26-feb-2007 04:14.<br>
26-feb 04:14 bacula-dir: Recycled volume "web-005"<br>
26-feb 04:14 bacula-sd: 3301 Issuing autochanger "loaded? drive 0"
command.<br>
26-feb 04:14 bacula-sd: 3302 Autochanger "loaded? drive 0", result
is Slot 3.<br>
26-feb 04:14 bacula-sd: 3307 Issuing autochanger "unload slot 3, drive
0" command.<br>
26-feb 04:15 bacula-sd: 3304 Issuing autochanger "load slot 4, drive
0" command.<br>
26-feb 04:15 bacula-sd: 3305 Autochanger "load slot 4, drive 0",
status is OK.<br>
26-feb 04:15 bacula-sd: 3301 Issuing autochanger "loaded? drive 0"
command.<br>
26-feb 04:15 bacula-sd: 3302 Autochanger "loaded? drive 0", result
is Slot 4.<br>
26-feb 04:15 bacula-sd: Recycled volume "web-005" on device "LTO1"
(/dev/lto1), all previous data lost.<br>
26-feb 04:15 bacula-sd: New volume "web-005" mounted on device
"LTO1" (/dev/lto1) at 26-feb-2007 04:15.<br>
26-feb 10:21 bacula-sd: End of Volume "web-005" at 528:6656 on
device "LTO1" (/dev/lto1). Write of 64512 bytes got -1.<br>
26-feb 10:21 bacula-sd: Re-read of last block succeeded.<br>
26-feb 10:21 bacula-sd: End of medium on Volume "web-005" Bytes=528,395,664,384
Blocks=8,190,656 at 26-feb-2007 10:21.<br>
26-feb 10:21 bacula-dir: Recycled volume "web-006"<br>
26-feb 10:21 bacula-sd: 3301 Issuing autochanger "loaded? drive 0"
command.<br>
26-feb 10:21 bacula-sd: 3302 Autochanger "loaded? drive 0", result
is Slot 4.<br>
26-feb 10:21 bacula-sd: 3307 Issuing autochanger "unload slot 4, drive
0" command.<br>
26-feb 10:22 bacula-sd: 3304 Issuing autochanger "load slot 5, drive
0" command.<br>
26-feb 10:22 bacula-sd: 3305 Autochanger "load slot 5, drive 0",
status is OK.<br>
26-feb 10:22 bacula-sd: 3301 Issuing autochanger "loaded? drive 0"
command.<br>
26-feb 10:22 bacula-sd: 3302 Autochanger "loaded? drive 0", result
is Slot 5.<br>
26-feb 10:23 bacula-sd: Recycled volume "web-006" on device "LTO1"
(/dev/lto1), all previous data lost.<br>
26-feb 10:23 bacula-sd: New volume "web-006" mounted on device
"LTO1" (/dev/lto1) at 26-feb-2007 10:23.<br>
26-feb 13:49 bacula-sd: Job write elapsed time = 17:48:45, Transfer rate
= 21.65 M bytes/second<br>
26-feb 13:49 bacula-sd: Alert: SCSI 2 tape drive:<br>
26-feb 13:49 bacula-sd: Alert: File number=267, block number=0, partition=0.<br>
26-feb 13:49 bacula-sd: Alert: Tape block size 0 bytes. Density code 0x44
(no translation).<br>
26-feb 13:49 bacula-sd: Alert: Soft error count since last status=0<br>
26-feb 13:49 bacula-sd: Alert: General status bits on (81010000):<br>
26-feb 13:49 bacula-sd: Alert: EOF ONLINE IM_REP_EN<br>
26-feb 13:49 bacula-dir: Bacula 2.0.2 (28Jan07): 26-feb-2007 13:49:03<br>
JobId: 12927<br>
Job:
webfs3-job.2007-02-25_19.55.40<br>
Backup Level: Full<br>
Client:
"webfs3" 2.0.2 (28Jan07) i686-redhat-linux-gnu,redhat,Enterprise
release<br>
FileSet: "webfs3-fileset"
2005-04-30 07:13:53<br>
Pool:
"webfs" (From Job resource)<br>
Storage: "LTO-1"
(From user selection)<br>
Scheduled time: 25-feb-2007 19:55:17<br>
Start time: 25-feb-2007
19:55:46<br>
End time: 26-feb-2007
13:49:03<br>
Elapsed time: 17 hours 53 mins
17 secs<br>
Priority: 10<br>
FD Files Written: 4,046,880<br>
SD Files Written: 4,046,880<br>
FD Bytes Written: 1,387,910,783,372 (1.387
TB)<br>
SD Bytes Written: 1,388,589,182,436 (1.388
TB)<br>
Rate:
21552.4 KB/s<br>
Software Compression: None<br>
VSS:
no<br>
Encryption: no<br>
Volume name(s): web-004|web-005|web-006<br>
Volume Session Id: 1<br>
Volume Session Time: 1172427565<br>
Last Volume Bytes: 266,951,559,168 (266.9 GB)<br>
Non-fatal FD errors: 0<br>
SD Errors: 0<br>
FD termination status: OK<br>
SD termination status: OK<br>
Termination: Backup OK<br>
<br>
</tt></font>
<br><font size=2 face="sans-serif">The thing that is not in favour of an
hardware or OS problem is that with the same hardware and OS bacula 1.36.3
had not this problem, it arised with 1.38.11.</font>
<br><font size=2 face="sans-serif">The device setup is quite simple:</font>
<br>
<br>
<br><font size=2><tt>Device {</tt></font>
<br><font size=2><tt> Name = LTO1</tt></font>
<br><font size=2><tt> Media Type = LTO-3</tt></font>
<br><font size=2><tt> Archive Device = /dev/lto1</tt></font>
<br><font size=2><tt> AutomaticMount = yes;
# when device opened, read it</tt></font>
<br><font size=2><tt> AlwaysOpen = no;</tt></font>
<br><font size=2><tt> Autoselect = no</tt></font>
<br><font size=2><tt> RemovableMedia = yes;</tt></font>
<br><font size=2><tt> RandomAccess = no;</tt></font>
<br><font size=2><tt> Changer Command = "/etc/bacula/mtx-changer
%c %o %S %a %d"</tt></font>
<br><font size=2><tt> Changer Device = /dev/chg4</tt></font>
<br><font size=2><tt> Drive Index = 0</tt></font>
<br><font size=2><tt> AutoChanger = yes</tt></font>
<br><font size=2><tt> Alert Command = "sh -c 'mt -f %a status'"</tt></font>
<br><font size=2><tt> Maximum Network Buffer Size = 65536</tt></font>
<br><font size=2><tt>}</tt></font>
<br>
<br><font size=2 face="sans-serif">Devices /dev/lto1 and /dev/chg4 are
symlinks to real devices in order to manage hardware configuration changes.</font>
<br>
<br><font size=2 face="sans-serif">Thanks again</font>
<br>
<br><font size=2 face="sans-serif">--------------------------------------------------------------------------<br>
Ferdinando Pasqualetti<br>
G.T.Dati srl<br>
Tel. 0557310862 - 3356172731 - Fax 055720143<br>
<br>
<br>
</font>
<br>
<br>
<br>
<table width=100%>
<tr valign=top>
<td width=40%><font size=1 face="sans-serif"><b>Arno Lehmann &lt;al < at > it...&gt;</b>
</font>
<br><font size=1 face="sans-serif">Inviato da: bacula-users-bounces < at > li...</font>
<p><font size=1 face="sans-serif">26/02/2007 20.33</font>
<td width=59%>
<table width=100%>
<tr valign=top>
<td>
<div align=right><font size=1 face="sans-serif">Per</font></div>
<td><font size=1 face="sans-serif">bacula-users &lt;bacula-users < at > li...&gt;</font>
<tr valign=top>
<td>
<div align=right><font size=1 face="sans-serif">CC</font></div>
<td>
<tr valign=top>
<td>
<div align=right><font size=1 face="sans-serif">Oggetto</font></div>
<td><font size=1 face="sans-serif">Re: [Bacula-users] Change tape problem</font></table>
<br>
<table>
<tr valign=top>
<td>
<td></table>
<br></table>
<br>
<br>
<br><font size=2><tt>Hello,<br>
<br>
On 2/26/2007 10:54 AM, Ferdinando Pasqualetti wrote:<br>
&gt; <br>
&gt; Hi Bacula users,<br>
&gt; sorry if you get this message two times, I sent it with a wrong sender
<br>
&gt; (not in the list), so I am sending it again.<br>
&gt; I am facing a problem that came out with rev. 1.38.11 (I never saw
it <br>
&gt; with 1.36.3). The problem did not happen all times, but very often.
Now <br>
&gt; I switched to 2.0.2 and this problem is much more frequent.<br>
&gt; The problem is that when a tape was exhausted bacula changes correctly
<br>
&gt; the tape in the autochanger drive but just after get this error:<br>
&gt; <br>
&gt; 25-feb 02:47 bacula-sd: End of Volume "web-004" at 594:3362
on device <br>
&gt; "LTO1" (/dev/lto1). Write of 64512 bytes got -1.<br>
&gt; 25-feb 02:47 bacula-sd: Re-read of last block succeeded.<br>
&gt; 25-feb 02:47 bacula-sd: End of medium on Volume "web-004"
<br>
&gt; Bytes=594,178,937,856 Blocks=9,210,362 at 25-feb-2007 02:47.<br>
&gt; 25-feb 02:47 bacula-sd: 3301 Issuing autochanger "loaded? drive
0" command.<br>
&gt; 25-feb 02:47 bacula-sd: 3302 Autochanger "loaded? drive 0",
result is <br>
&gt; Slot 3.<br>
&gt; 25-feb 02:47 bacula-sd: 3307 Issuing autochanger "unload slot
3, drive <br>
&gt; 0" command.<br>
&gt; 25-feb 02:48 bacula-sd: 3304 Issuing autochanger "load slot 4,
drive 0" <br>
&gt; command.<br>
&gt; 25-feb 02:48 bacula-sd: 3305 Autochanger "load slot 4, drive
0", status <br>
&gt; is OK.<br>
&gt; 25-feb 02:48 bacula-sd: 3301 Issuing autochanger "loaded? drive
0" command.<br>
&gt; 25-feb 02:48 bacula-sd: 3302 Autochanger "loaded? drive 0",
result is <br>
&gt; Slot 4.<br>
&gt; 25-feb 02:49 bacula-sd: Wrote label to prelabeled Volume "web-005"
on <br>
&gt; device "LTO1" (/dev/lto1)<br>
&gt; 25-feb 02:49 bacula-sd: New volume "web-005" mounted on
device "LTO1" <br>
&gt; (/dev/lto1) at 25-feb-2007 02:49.<br>
&gt; 25-feb 02:49 bacula-sd: End of Volume "web-005" at 0:1 on
device "LTO1" <br>
&gt; (/dev/lto1). Write of 64512 bytes got -1.<br>
&gt; 25-feb 02:49 bacula-sd: webfs3-job.2007-02-24_20.03.22 Error: Re-read
of <br>
&gt; last block OK, but block numbers differ. Last block=0 Current block=9210362.<br>
&gt; 25-feb 02:49 bacula-sd: Job write elapsed time = 06:43:26, Transfer
rate <br>
&gt; = 24.52 M bytes/second<br>
&gt; 25-feb 02:49 webfs3: webfs3-job.2007-02-24_20.03.22 Fatal error: <br>
&gt; backup.c:860 Network send error to SD. ERR=Pipe rotta<br>
&gt; 25-feb 02:49 bacula-dir: webfs3-job.2007-02-24_20.03.22 Error: Bacula
<br>
&gt; 2.0.2 (28Jan07): 25-feb-2007 02:49:25<br>
&gt; <br>
&gt; It seems there are two problems, the first one (and the most important
<br>
&gt; one) is that bacula get an end of volume on the new tape,<br>
<br>
What Bacula reports as an EOT can be caused by a drive error, too, so <br>
for the time being I assume that the second error is tightly related to
<br>
this one.<br>
<br>
&gt; and the second <br>
&gt; one is the difference in the last block (it appears to be the last
block <br>
&gt; of the previous tape).<br>
<br>
If that's the case, and your description seems quite clear, you might <br>
have found an OS or hardware bug, too.<br>
<br>
This is only guesswork, but it could be possible that, after a tape <br>
change, the hardware or the tape driver don't update their state <br>
information.<br>
<br>
If that's the case, you could try the following:<br>
- first, have a look at your system log and dmesg output. There might be
<br>
errors reported there.<br>
- second, try to reproduce the problem without using Bacula. Unmount the
<br>
tape drive from bconsole. Load a tape (an unused one, or one with write
<br>
protection). If you use an empty tape, write some data and some file <br>
marks to it, ending with an EOT mark. dd and mt are tools for that purpose.<br>
Then, use tapeinfo or st to observe the tape status, especialy the block
<br>
position reported, when doing some rewinds, fast forwards, offline, and
<br>
see what happens after you used mtx to unload and reload that tape.<br>
<br>
If there really is a problem with the hardware or the OD driver, you <br>
should be able to reproduce it then. Updating the drive firmware and the
<br>
OS (or, if that's up to date, filing a bug report) would be two options
<br>
then.<br>
<br>
Otherwise, you should run btape again, because there are some things in
<br>
the report I don't like - errors writing the last block to tape should
<br>
not happen with current hardware, for example. You might try to tune <br>
your device configuration, and perhaps you'll have to set the tape <br>
driver to a different write mode. Suggesting something is difficult <br>
without seeing how it's setup now Smile<br>
<br>
&gt; Bacula is a MySQL version on a RedHat AS 4.04, rpmbuilt on that system,
<br>
&gt; an HP proliant G3 3.2 Ghz, 2Gb RAM.<br>
&gt; The tape is an MSL6000 with two LTO-3 drives, drived by bacula directly
<br>
&gt; (not using the autochanger as device - 1.36.3 setup).<br>
&gt; Btape tests run correctly, including the "fill and change tape"
(I am <br>
&gt; attaching the test result, if someone is interested).<br>
&gt; Did anyone get a similar problem?<br>
<br>
That basic setup should run ok I think... nothing unusual there.<br>
<br>
Arno<br>
<br>
&gt; <br>
&gt; --------------------------------------------------------------------------<br>
&gt; Ferdinando Pasqualetti<br>
&gt; G.T.Dati srl<br>
&gt; Tel. 0557310862 - 3356172731 - Fax 055720143<br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; ------------------------------------------------------------------------<br>
&gt; <br>
&gt; -------------------------------------------------------------------------<br>
&gt; Take Surveys. Earn Cash. Influence the Future of IT<br>
&gt; Join SourceForge.net's Techsay panel and you'll get the chance to
share your<br>
&gt; opinions on IT &amp; business topics through brief surveys-and earn
cash<br>
&gt; http://www.techsay.com/default.php?page=join.php&amp;p=sourceforge&amp;CID=DEVDEV<br>
&gt; <br>
&gt; <br>
&gt; ------------------------------------------------------------------------<br>
&gt; <br>
&gt; _______________________________________________<br>
&gt; Bacula-users mailing list<br>
&gt; Bacula-users < at > li...<br>
&gt; https://lists.sourceforge.net/lists/listinfo/bacula-users<br>
<br>
-- <br>
IT-Service Lehmann
al < at > it...<br>
Arno Lehmann http://www.its-lehmann.de<br>
<br>
-------------------------------------------------------------------------<br>
Take Surveys. Earn Cash. Influence the Future of IT<br>
Join SourceForge.net's Techsay panel and you'll get the chance to share
your<br>
opinions on IT &amp; business topics through brief surveys-and earn cash<br>

http://www.techsay.com/default.php?page=join.php&amp;p=sourceforge&amp;CID=DEVDEV<br>
_______________________________________________<br>
Bacula-users mailing list<br>
Bacula-users < at > li...<br>
https://lists.sourceforge.net/lists/listinfo/bacula-users<br>
</tt></font>
<br>

Post Rif: Re: Change tape problem 
<br><font size=2 face="sans-serif">Hi Arno,</font>
<br><font size=2 face="sans-serif">I made some tests and this is what I
think.</font>
<br><font size=2 face="sans-serif">When there is a tape change after an
out of space error susequent block write continue to get that error even
after the tape change by the robot. This continue.</font>
<br><font size=2 face="sans-serif">I made some changes to the block.c routine
(very simple, because I'm not a C programmer and also I don't know the
logic of sd program). I made the routine enter the retry loop even for
ERNOSPC if file number is 0. This made bacula-sd work correctly (but it
took 20 hours to write file 0). After writing the EOF mark speed is normal
again.</font>
<br><font size=2 face="sans-serif">My idea is that changing the tape does
not reset the EOD condition on the tape until a file mark is written. I
do not know if this a wrong device or OS error, but I believe that the
FD of tape should be closed and reopened in a tape change.</font>
<br><font size=2 face="sans-serif">dd and mt tests always gave correct
results, but dd always write an EOF mark at the and of the transfer.</font>
<br>
<br><font size=2 face="sans-serif">If you have some idea about that I will
be very happy. Thank you very much in any c asze.</font>
<br>
<br><font size=2 face="sans-serif"> </font>
<br><font size=2 face="sans-serif">--------------------------------------------------------------------------<br>
Ferdinando Pasqualetti<br>
G.T.Dati srl<br>
Tel. 0557310862 - 3356172731 - Fax 055720143<br>
<br>
<br>
</font>
<br>
<br>
<br>
<table width=100%>
<tr valign=top>
<td width=40%><font size=1 face="sans-serif"><b>Ferdinando Pasqualetti/San
Lazzaro/Conserve Italia</b></font>
<p><font size=1 face="sans-serif">27/02/2007 09.47</font>
<td width=59%>
<table width=100%>
<tr valign=top>
<td>
<div align=right><font size=1 face="sans-serif">Per</font></div>
<td><font size=1 face="sans-serif">Arno Lehmann &lt;al < at > it...&gt;</font>
<tr valign=top>
<td>
<div align=right><font size=1 face="sans-serif">CC</font></div>
<td><font size=1 face="sans-serif">bacula-users &lt;bacula-users < at > li...&gt;</font>
<tr valign=top>
<td>
<div align=right><font size=1 face="sans-serif">Oggetto</font></div>
<td><font size=1 face="sans-serif">Rif: Re: [Bacula-users] Change tape
problem</font><a href=Notes:///C12563A900369A93/D46731D63F38165B8025651C003EAC4E/56E29AEC82B8E837C125728E006B77F9>Link</a></table>
<br>
<table>
<tr valign=top>
<td>
<td></table>
<br></table>
<br>
<br><font size=2 face="sans-serif">Hi Arno,</font>
<br><font size=2 face="sans-serif">thank you very much for your answer.
I will try asap the tests you are suggesting. By the way, I purged the
volumes involved in the error shown in the original message (it was the
third try), restarted the backup job and here is the (correct) result.</font>
<br>
<br><font size=2><tt>25-feb 19:55 bacula-dir: Start Backup JobId 12927,
Job=webfs3-job.2007-02-25_19.55.40<br>
25-feb 19:55 bacula-dir: Recycled volume "web-004"<br>
25-feb 19:55 webfs3: ClientRunBeforeJob: run command "/root/restartsmb"<br>
25-feb 19:55 webfs3: ClientRunBeforeJob: Shutting down SMB services: [
OK ]<br>
25-feb 19:55 webfs3: ClientRunBeforeJob: smbd: nessun processo terminato<br>
25-feb 19:55 webfs3: ClientRunBeforeJob: smbd: nessun processo terminato<br>
25-feb 19:55 webfs3: ClientRunBeforeJob: Starting SMB services: [ OK
]<br>
25-feb 19:55 webfs3: ClientRunBeforeJob: [ OK ]<br>
25-feb 19:55 bacula-sd: 3307 Issuing autochanger "unload slot 7, drive
0" command.<br>
25-feb 19:57 bacula-sd: 3304 Issuing autochanger "load slot 3, drive
0" command.<br>
25-feb 19:57 bacula-sd: 3305 Autochanger "load slot 3, drive 0",
status is OK.<br>
25-feb 19:57 bacula-sd: 3301 Issuing autochanger "loaded? drive 0"
command.<br>
25-feb 19:57 bacula-sd: 3302 Autochanger "loaded? drive 0", result
is Slot 3.<br>
25-feb 19:57 bacula-sd: Recycled volume "web-004" on device "LTO1"
(/dev/lto1), all previous data lost.<br>
webfs3: /proc is a different filesystem. Will not descend
from / into /proc<br>
webfs3: /boot is a different filesystem. Will not descend
from / into /boot<br>
webfs3: /dev is a different filesystem. Will not descend
from / into /dev<br>
webfs3: /var/lib/nfs/rpc_pipefs is a different filesystem.
Will not descend from / into /var/lib/nfs/rpc_pipefs<br>
webfs3: /sys is a different filesystem. Will not descend
from / into /sys<br>
webfs3: /uno is a different filesystem. Will not descend
from / into /uno<br>
26-feb 04:14 bacula-sd: End of Volume "web-004" at 594:6519 on
device "LTO1" (/dev/lto1). Write of 64512 bytes got -1.<br>
26-feb 04:14 bacula-sd: Re-read of last block succeeded.<br>
26-feb 04:14 bacula-sd: End of medium on Volume "web-004" Bytes=594,382,602,240
Blocks=9,213,519 at 26-feb-2007 04:14.<br>
26-feb 04:14 bacula-dir: Recycled volume "web-005"<br>
26-feb 04:14 bacula-sd: 3301 Issuing autochanger "loaded? drive 0"
command.<br>
26-feb 04:14 bacula-sd: 3302 Autochanger "loaded? drive 0", result
is Slot 3.<br>
26-feb 04:14 bacula-sd: 3307 Issuing autochanger "unload slot 3, drive
0" command.<br>
26-feb 04:15 bacula-sd: 3304 Issuing autochanger "load slot 4, drive
0" command.<br>
26-feb 04:15 bacula-sd: 3305 Autochanger "load slot 4, drive 0",
status is OK.<br>
26-feb 04:15 bacula-sd: 3301 Issuing autochanger "loaded? drive 0"
command.<br>
26-feb 04:15 bacula-sd: 3302 Autochanger "loaded? drive 0", result
is Slot 4.<br>
26-feb 04:15 bacula-sd: Recycled volume "web-005" on device "LTO1"
(/dev/lto1), all previous data lost.<br>
26-feb 04:15 bacula-sd: New volume "web-005" mounted on device
"LTO1" (/dev/lto1) at 26-feb-2007 04:15.<br>
26-feb 10:21 bacula-sd: End of Volume "web-005" at 528:6656 on
device "LTO1" (/dev/lto1). Write of 64512 bytes got -1.<br>
26-feb 10:21 bacula-sd: Re-read of last block succeeded.<br>
26-feb 10:21 bacula-sd: End of medium on Volume "web-005" Bytes=528,395,664,384
Blocks=8,190,656 at 26-feb-2007 10:21.<br>
26-feb 10:21 bacula-dir: Recycled volume "web-006"<br>
26-feb 10:21 bacula-sd: 3301 Issuing autochanger "loaded? drive 0"
command.<br>
26-feb 10:21 bacula-sd: 3302 Autochanger "loaded? drive 0", result
is Slot 4.<br>
26-feb 10:21 bacula-sd: 3307 Issuing autochanger "unload slot 4, drive
0" command.<br>
26-feb 10:22 bacula-sd: 3304 Issuing autochanger "load slot 5, drive
0" command.<br>
26-feb 10:22 bacula-sd: 3305 Autochanger "load slot 5, drive 0",
status is OK.<br>
26-feb 10:22 bacula-sd: 3301 Issuing autochanger "loaded? drive 0"
command.<br>
26-feb 10:22 bacula-sd: 3302 Autochanger "loaded? drive 0", result
is Slot 5.<br>
26-feb 10:23 bacula-sd: Recycled volume "web-006" on device "LTO1"
(/dev/lto1), all previous data lost.<br>
26-feb 10:23 bacula-sd: New volume "web-006" mounted on device
"LTO1" (/dev/lto1) at 26-feb-2007 10:23.<br>
26-feb 13:49 bacula-sd: Job write elapsed time = 17:48:45, Transfer rate
= 21.65 M bytes/second<br>
26-feb 13:49 bacula-sd: Alert: SCSI 2 tape drive:<br>
26-feb 13:49 bacula-sd: Alert: File number=267, block number=0, partition=0.<br>
26-feb 13:49 bacula-sd: Alert: Tape block size 0 bytes. Density code 0x44
(no translation).<br>
26-feb 13:49 bacula-sd: Alert: Soft error count since last status=0<br>
26-feb 13:49 bacula-sd: Alert: General status bits on (81010000):<br>
26-feb 13:49 bacula-sd: Alert: EOF ONLINE IM_REP_EN<br>
26-feb 13:49 bacula-dir: Bacula 2.0.2 (28Jan07): 26-feb-2007 13:49:03<br>
JobId: 12927<br>
Job:
webfs3-job.2007-02-25_19.55.40<br>
Backup Level: Full<br>
Client:
"webfs3" 2.0.2 (28Jan07) i686-redhat-linux-gnu,redhat,Enterprise
release<br>
FileSet: "webfs3-fileset"
2005-04-30 07:13:53<br>
Pool:
"webfs" (From Job resource)<br>
Storage: "LTO-1"
(From user selection)<br>
Scheduled time: 25-feb-2007 19:55:17<br>
Start time: 25-feb-2007
19:55:46<br>
End time: 26-feb-2007
13:49:03<br>
Elapsed time: 17 hours 53 mins
17 secs<br>
Priority: 10<br>
FD Files Written: 4,046,880<br>
SD Files Written: 4,046,880<br>
FD Bytes Written: 1,387,910,783,372 (1.387
TB)<br>
SD Bytes Written: 1,388,589,182,436 (1.388
TB)<br>
Rate:
21552.4 KB/s<br>
Software Compression: None<br>
VSS:
no<br>
Encryption: no<br>
Volume name(s): web-004|web-005|web-006<br>
Volume Session Id: 1<br>
Volume Session Time: 1172427565<br>
Last Volume Bytes: 266,951,559,168 (266.9 GB)<br>
Non-fatal FD errors: 0<br>
SD Errors: 0<br>
FD termination status: OK<br>
SD termination status: OK<br>
Termination: Backup OK<br>
<br>
</tt></font>
<br><font size=2 face="sans-serif">The thing that is not in favour of an
hardware or OS problem is that with the same hardware and OS bacula 1.36.3
had not this problem, it arised with 1.38.11.</font>
<br><font size=2 face="sans-serif">The device setup is quite simple:</font>
<br>
<br>
<br><font size=2><tt>Device {</tt></font>
<br><font size=2><tt> Name = LTO1</tt></font>
<br><font size=2><tt> Media Type = LTO-3</tt></font>
<br><font size=2><tt> Archive Device = /dev/lto1</tt></font>
<br><font size=2><tt> AutomaticMount = yes;
# when device opened, read it</tt></font>
<br><font size=2><tt> AlwaysOpen = no;</tt></font>
<br><font size=2><tt> Autoselect = no</tt></font>
<br><font size=2><tt> RemovableMedia = yes;</tt></font>
<br><font size=2><tt> RandomAccess = no;</tt></font>
<br><font size=2><tt> Changer Command = "/etc/bacula/mtx-changer
%c %o %S %a %d"</tt></font>
<br><font size=2><tt> Changer Device = /dev/chg4</tt></font>
<br><font size=2><tt> Drive Index = 0</tt></font>
<br><font size=2><tt> AutoChanger = yes</tt></font>
<br><font size=2><tt> Alert Command = "sh -c 'mt -f %a status'"</tt></font>
<br><font size=2><tt> Maximum Network Buffer Size = 65536</tt></font>
<br><font size=2><tt>}</tt></font>
<br>
<br><font size=2 face="sans-serif">Devices /dev/lto1 and /dev/chg4 are
symlinks to real devices in order to manage hardware configuration changes.</font>
<br>
<br><font size=2 face="sans-serif">Thanks again</font>
<br>
<br><font size=2 face="sans-serif">--------------------------------------------------------------------------<br>
Ferdinando Pasqualetti<br>
G.T.Dati srl<br>
Tel. 0557310862 - 3356172731 - Fax 055720143<br>
<br>
<br>
</font>
<br>
<br>
<br>
<table width=100%>
<tr valign=top>
<td width=40%><font size=1 face="sans-serif"><b>Arno Lehmann &lt;al < at > it...&gt;</b>
</font>
<br><font size=1 face="sans-serif">Inviato da: bacula-users-bounces < at > li...</font>
<p><font size=1 face="sans-serif">26/02/2007 20.33</font>
<td width=59%>
<table width=100%>
<tr valign=top>
<td>
<div align=right><font size=1 face="sans-serif">Per</font></div>
<td><font size=1 face="sans-serif">bacula-users &lt;bacula-users < at > li...&gt;</font>
<tr valign=top>
<td>
<div align=right><font size=1 face="sans-serif">CC</font></div>
<td>
<tr valign=top>
<td>
<div align=right><font size=1 face="sans-serif">Oggetto</font></div>
<td><font size=1 face="sans-serif">Re: [Bacula-users] Change tape problem</font></table>
<br>
<table>
<tr valign=top>
<td>
<td></table>
<br></table>
<br>
<br>
<br><font size=2><tt>Hello,<br>
<br>
On 2/26/2007 10:54 AM, Ferdinando Pasqualetti wrote:<br>
&gt; <br>
&gt; Hi Bacula users,<br>
&gt; sorry if you get this message two times, I sent it with a wrong sender
<br>
&gt; (not in the list), so I am sending it again.<br>
&gt; I am facing a problem that came out with rev. 1.38.11 (I never saw
it <br>
&gt; with 1.36.3). The problem did not happen all times, but very often.
Now <br>
&gt; I switched to 2.0.2 and this problem is much more frequent.<br>
&gt; The problem is that when a tape was exhausted bacula changes correctly
<br>
&gt; the tape in the autochanger drive but just after get this error:<br>
&gt; <br>
&gt; 25-feb 02:47 bacula-sd: End of Volume "web-004" at 594:3362
on device <br>
&gt; "LTO1" (/dev/lto1). Write of 64512 bytes got -1.<br>
&gt; 25-feb 02:47 bacula-sd: Re-read of last block succeeded.<br>
&gt; 25-feb 02:47 bacula-sd: End of medium on Volume "web-004"
<br>
&gt; Bytes=594,178,937,856 Blocks=9,210,362 at 25-feb-2007 02:47.<br>
&gt; 25-feb 02:47 bacula-sd: 3301 Issuing autochanger "loaded? drive
0" command.<br>
&gt; 25-feb 02:47 bacula-sd: 3302 Autochanger "loaded? drive 0",
result is <br>
&gt; Slot 3.<br>
&gt; 25-feb 02:47 bacula-sd: 3307 Issuing autochanger "unload slot
3, drive <br>
&gt; 0" command.<br>
&gt; 25-feb 02:48 bacula-sd: 3304 Issuing autochanger "load slot 4,
drive 0" <br>
&gt; command.<br>
&gt; 25-feb 02:48 bacula-sd: 3305 Autochanger "load slot 4, drive
0", status <br>
&gt; is OK.<br>
&gt; 25-feb 02:48 bacula-sd: 3301 Issuing autochanger "loaded? drive
0" command.<br>
&gt; 25-feb 02:48 bacula-sd: 3302 Autochanger "loaded? drive 0",
result is <br>
&gt; Slot 4.<br>
&gt; 25-feb 02:49 bacula-sd: Wrote label to prelabeled Volume "web-005"
on <br>
&gt; device "LTO1" (/dev/lto1)<br>
&gt; 25-feb 02:49 bacula-sd: New volume "web-005" mounted on
device "LTO1" <br>
&gt; (/dev/lto1) at 25-feb-2007 02:49.<br>
&gt; 25-feb 02:49 bacula-sd: End of Volume "web-005" at 0:1 on
device "LTO1" <br>
&gt; (/dev/lto1). Write of 64512 bytes got -1.<br>
&gt; 25-feb 02:49 bacula-sd: webfs3-job.2007-02-24_20.03.22 Error: Re-read
of <br>
&gt; last block OK, but block numbers differ. Last block=0 Current block=9210362.<br>
&gt; 25-feb 02:49 bacula-sd: Job write elapsed time = 06:43:26, Transfer
rate <br>
&gt; = 24.52 M bytes/second<br>
&gt; 25-feb 02:49 webfs3: webfs3-job.2007-02-24_20.03.22 Fatal error: <br>
&gt; backup.c:860 Network send error to SD. ERR=Pipe rotta<br>
&gt; 25-feb 02:49 bacula-dir: webfs3-job.2007-02-24_20.03.22 Error: Bacula
<br>
&gt; 2.0.2 (28Jan07): 25-feb-2007 02:49:25<br>
&gt; <br>
&gt; It seems there are two problems, the first one (and the most important
<br>
&gt; one) is that bacula get an end of volume on the new tape,<br>
<br>
What Bacula reports as an EOT can be caused by a drive error, too, so <br>
for the time being I assume that the second error is tightly related to
<br>
this one.<br>
<br>
&gt; and the second <br>
&gt; one is the difference in the last block (it appears to be the last
block <br>
&gt; of the previous tape).<br>
<br>
If that's the case, and your description seems quite clear, you might <br>
have found an OS or hardware bug, too.<br>
<br>
This is only guesswork, but it could be possible that, after a tape <br>
change, the hardware or the tape driver don't update their state <br>
information.<br>
<br>
If that's the case, you could try the following:<br>
- first, have a look at your system log and dmesg output. There might be
<br>
errors reported there.<br>
- second, try to reproduce the problem without using Bacula. Unmount the
<br>
tape drive from bconsole. Load a tape (an unused one, or one with write
<br>
protection). If you use an empty tape, write some data and some file <br>
marks to it, ending with an EOT mark. dd and mt are tools for that purpose.<br>
Then, use tapeinfo or st to observe the tape status, especialy the block
<br>
position reported, when doing some rewinds, fast forwards, offline, and
<br>
see what happens after you used mtx to unload and reload that tape.<br>
<br>
If there really is a problem with the hardware or the OD driver, you <br>
should be able to reproduce it then. Updating the drive firmware and the
<br>
OS (or, if that's up to date, filing a bug report) would be two options
<br>
then.<br>
<br>
Otherwise, you should run btape again, because there are some things in
<br>
the report I don't like - errors writing the last block to tape should
<br>
not happen with current hardware, for example. You might try to tune <br>
your device configuration, and perhaps you'll have to set the tape <br>
driver to a different write mode. Suggesting something is difficult <br>
without seeing how it's setup now Smile<br>
<br>
&gt; Bacula is a MySQL version on a RedHat AS 4.04, rpmbuilt on that system,
<br>
&gt; an HP proliant G3 3.2 Ghz, 2Gb RAM.<br>
&gt; The tape is an MSL6000 with two LTO-3 drives, drived by bacula directly
<br>
&gt; (not using the autochanger as device - 1.36.3 setup).<br>
&gt; Btape tests run correctly, including the "fill and change tape"
(I am <br>
&gt; attaching the test result, if someone is interested).<br>
&gt; Did anyone get a similar problem?<br>
<br>
That basic setup should run ok I think... nothing unusual there.<br>
<br>
Arno<br>
<br>
&gt; <br>
&gt; --------------------------------------------------------------------------<br>
&gt; Ferdinando Pasqualetti<br>
&gt; G.T.Dati srl<br>
&gt; Tel. 0557310862 - 3356172731 - Fax 055720143<br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; ------------------------------------------------------------------------<br>
&gt; <br>
&gt; -------------------------------------------------------------------------<br>
&gt; Take Surveys. Earn Cash. Influence the Future of IT<br>
&gt; Join SourceForge.net's Techsay panel and you'll get the chance to
share your<br>
&gt; opinions on IT &amp; business topics through brief surveys-and earn
cash<br>
&gt; http://www.techsay.com/default.php?page=join.php&amp;p=sourceforge&amp;CID=DEVDEV<br>
&gt; <br>
&gt; <br>
&gt; ------------------------------------------------------------------------<br>
&gt; <br>
&gt; _______________________________________________<br>
&gt; Bacula-users mailing list<br>
&gt; Bacula-users < at > li...<br>
&gt; https://lists.sourceforge.net/lists/listinfo/bacula-users<br>
<br>
-- <br>
IT-Service Lehmann
al < at > it...<br>
Arno Lehmann http://www.its-lehmann.de<br>
<br>
-------------------------------------------------------------------------<br>
Take Surveys. Earn Cash. Influence the Future of IT<br>
Join SourceForge.net's Techsay panel and you'll get the chance to share
your<br>
opinions on IT &amp; business topics through brief surveys-and earn cash<br>

http://www.techsay.com/default.php?page=join.php&amp;p=sourceforge&amp;CID=DEVDEV<br>
_______________________________________________<br>
Bacula-users mailing list<br>
Bacula-users < at > li...<br>
https://lists.sourceforge.net/lists/listinfo/bacula-users<br>
</tt></font>
<br>
<br>

Post Rif: Re: Change tape problem 
Hi,

On 3/11/2007 6:33 PM, Ferdinando Pasqualetti wrote:

Hi Arno,
I made some tests and this is what I think.
When there is a tape change after an out of space error susequent block
write continue to get that error even after the tape change by the
robot. This continue.
I made some changes to the block.c routine

You'd better discuss this at bacula-devel, I think, or send Kern a mail
explaining the problem and the resolution.

(very simple, because I'm not
a C programmer and also I don't know the logic of sd program). I made
the routine enter the retry loop even for ERNOSPC if file number is 0.
This made bacula-sd work correctly (but it took 20 hours to write file
0). After writing the EOF mark speed is normal again.
My idea is that changing the tape does not reset the EOD condition on
the tape

That sounds like a bug, either in the hardware or the HBA driver.

until a file mark is written. I do not know if this a wrong
device or OS error, but I believe that the FD of tape should be closed
and reopened in a tape change.

I know about nothing about these details, so I won't comment on it...

dd and mt tests always gave correct results, but dd always write an EOF
mark at the and of the transfer.

If you have some idea about that I will be very happy. Thank you very
much in any c asze.

Difficult problem, I think.

If this is a hardware or driver problem, I don't think modifying the SD
code is the right solution.

If it works for you - fine, but it might be that you have to manage that
patch for your installation yourself.

Arno


--------------------------------------------------------------------------
Ferdinando Pasqualetti
G.T.Dati srl
Tel. 0557310862 - 3356172731 - Fax 055720143





*Ferdinando Pasqualetti/San Lazzaro/Conserve Italia*

27/02/2007 09.47


Per
Arno Lehmann <al < at > it...>
CC
bacula-users <bacula-users < at > li...>
Oggetto
Rif: Re: [Bacula-users] Change tape problem Link
<Notes:///C12563A900369A93/D46731D63F38165B8025651C003EAC4E/56E29AEC82B8E837C125728E006B77F9>







Hi Arno,
thank you very much for your answer. I will try asap the tests you are
suggesting. By the way, I purged the volumes involved in the error shown
in the original message (it was the third try), restarted the backup job
and here is the (correct) result.

25-feb 19:55 bacula-dir: Start Backup JobId 12927,
Job=webfs3-job.2007-02-25_19.55.40
25-feb 19:55 bacula-dir: Recycled volume "web-004"
25-feb 19:55 webfs3: ClientRunBeforeJob: run command "/root/restartsmb"
25-feb 19:55 webfs3: ClientRunBeforeJob: Shutting down SMB services: [
OK ]
25-feb 19:55 webfs3: ClientRunBeforeJob: smbd: nessun processo terminato
25-feb 19:55 webfs3: ClientRunBeforeJob: smbd: nessun processo terminato
25-feb 19:55 webfs3: ClientRunBeforeJob: Starting SMB services: [ OK ]
25-feb 19:55 webfs3: ClientRunBeforeJob: [ OK ]
25-feb 19:55 bacula-sd: 3307 Issuing autochanger "unload slot 7, drive
0" command.
25-feb 19:57 bacula-sd: 3304 Issuing autochanger "load slot 3, drive 0"
command.
25-feb 19:57 bacula-sd: 3305 Autochanger "load slot 3, drive 0", status
is OK.
25-feb 19:57 bacula-sd: 3301 Issuing autochanger "loaded? drive 0" command.
25-feb 19:57 bacula-sd: 3302 Autochanger "loaded? drive 0", result is
Slot 3.
25-feb 19:57 bacula-sd: Recycled volume "web-004" on device "LTO1"
(/dev/lto1), all previous data lost.
webfs3: /proc is a different filesystem. Will not descend from /
into /proc
webfs3: /boot is a different filesystem. Will not descend from /
into /boot
webfs3: /dev is a different filesystem. Will not descend from /
into /dev
webfs3: /var/lib/nfs/rpc_pipefs is a different filesystem. Will not
descend from / into /var/lib/nfs/rpc_pipefs
webfs3: /sys is a different filesystem. Will not descend from /
into /sys
webfs3: /uno is a different filesystem. Will not descend from /
into /uno
26-feb 04:14 bacula-sd: End of Volume "web-004" at 594:6519 on device
"LTO1" (/dev/lto1). Write of 64512 bytes got -1.
26-feb 04:14 bacula-sd: Re-read of last block succeeded.
26-feb 04:14 bacula-sd: End of medium on Volume "web-004"
Bytes=594,382,602,240 Blocks=9,213,519 at 26-feb-2007 04:14.
26-feb 04:14 bacula-dir: Recycled volume "web-005"
26-feb 04:14 bacula-sd: 3301 Issuing autochanger "loaded? drive 0" command.
26-feb 04:14 bacula-sd: 3302 Autochanger "loaded? drive 0", result is
Slot 3.
26-feb 04:14 bacula-sd: 3307 Issuing autochanger "unload slot 3, drive
0" command.
26-feb 04:15 bacula-sd: 3304 Issuing autochanger "load slot 4, drive 0"
command.
26-feb 04:15 bacula-sd: 3305 Autochanger "load slot 4, drive 0", status
is OK.
26-feb 04:15 bacula-sd: 3301 Issuing autochanger "loaded? drive 0" command.
26-feb 04:15 bacula-sd: 3302 Autochanger "loaded? drive 0", result is
Slot 4.
26-feb 04:15 bacula-sd: Recycled volume "web-005" on device "LTO1"
(/dev/lto1), all previous data lost.
26-feb 04:15 bacula-sd: New volume "web-005" mounted on device "LTO1"
(/dev/lto1) at 26-feb-2007 04:15.
26-feb 10:21 bacula-sd: End of Volume "web-005" at 528:6656 on device
"LTO1" (/dev/lto1). Write of 64512 bytes got -1.
26-feb 10:21 bacula-sd: Re-read of last block succeeded.
26-feb 10:21 bacula-sd: End of medium on Volume "web-005"
Bytes=528,395,664,384 Blocks=8,190,656 at 26-feb-2007 10:21.
26-feb 10:21 bacula-dir: Recycled volume "web-006"
26-feb 10:21 bacula-sd: 3301 Issuing autochanger "loaded? drive 0" command.
26-feb 10:21 bacula-sd: 3302 Autochanger "loaded? drive 0", result is
Slot 4.
26-feb 10:21 bacula-sd: 3307 Issuing autochanger "unload slot 4, drive
0" command.
26-feb 10:22 bacula-sd: 3304 Issuing autochanger "load slot 5, drive 0"
command.
26-feb 10:22 bacula-sd: 3305 Autochanger "load slot 5, drive 0", status
is OK.
26-feb 10:22 bacula-sd: 3301 Issuing autochanger "loaded? drive 0" command.
26-feb 10:22 bacula-sd: 3302 Autochanger "loaded? drive 0", result is
Slot 5.
26-feb 10:23 bacula-sd: Recycled volume "web-006" on device "LTO1"
(/dev/lto1), all previous data lost.
26-feb 10:23 bacula-sd: New volume "web-006" mounted on device "LTO1"
(/dev/lto1) at 26-feb-2007 10:23.
26-feb 13:49 bacula-sd: Job write elapsed time = 17:48:45, Transfer rate
= 21.65 M bytes/second
26-feb 13:49 bacula-sd: Alert: SCSI 2 tape drive:
26-feb 13:49 bacula-sd: Alert: File number=267, block number=0, partition=0.
26-feb 13:49 bacula-sd: Alert: Tape block size 0 bytes. Density code
0x44 (no translation).
26-feb 13:49 bacula-sd: Alert: Soft error count since last status=0
26-feb 13:49 bacula-sd: Alert: General status bits on (81010000):
26-feb 13:49 bacula-sd: Alert: EOF ONLINE IM_REP_EN
26-feb 13:49 bacula-dir: Bacula 2.0.2 (28Jan07): 26-feb-2007 13:49:03
JobId: 12927
Job: webfs3-job.2007-02-25_19.55.40
Backup Level: Full
Client: "webfs3" 2.0.2 (28Jan07)
i686-redhat-linux-gnu,redhat,Enterprise release
FileSet: "webfs3-fileset" 2005-04-30 07:13:53
Pool: "webfs" (From Job resource)
Storage: "LTO-1" (From user selection)
Scheduled time: 25-feb-2007 19:55:17
Start time: 25-feb-2007 19:55:46
End time: 26-feb-2007 13:49:03
Elapsed time: 17 hours 53 mins 17 secs
Priority: 10
FD Files Written: 4,046,880
SD Files Written: 4,046,880
FD Bytes Written: 1,387,910,783,372 (1.387 TB)
SD Bytes Written: 1,388,589,182,436 (1.388 TB)
Rate: 21552.4 KB/s
Software Compression: None
VSS: no
Encryption: no
Volume name(s): web-004|web-005|web-006
Volume Session Id: 1
Volume Session Time: 1172427565
Last Volume Bytes: 266,951,559,168 (266.9 GB)
Non-fatal FD errors: 0
SD Errors: 0
FD termination status: OK
SD termination status: OK
Termination: Backup OK


The thing that is not in favour of an hardware or OS problem is that
with the same hardware and OS bacula 1.36.3 had not this problem, it
arised with 1.38.11.
The device setup is quite simple:


Device {
Name = LTO1
Media Type = LTO-3
Archive Device = /dev/lto1
AutomaticMount = yes; # when device opened, read it
AlwaysOpen = no;
Autoselect = no
RemovableMedia = yes;
RandomAccess = no;
Changer Command = "/etc/bacula/mtx-changer %c %o %S %a %d"
Changer Device = /dev/chg4
Drive Index = 0
AutoChanger = yes
Alert Command = "sh -c 'mt -f %a status'"
Maximum Network Buffer Size = 65536
}

Devices /dev/lto1 and /dev/chg4 are symlinks to real devices in order to
manage hardware configuration changes.

Thanks again

--------------------------------------------------------------------------
Ferdinando Pasqualetti
G.T.Dati srl
Tel. 0557310862 - 3356172731 - Fax 055720143





*Arno Lehmann <al < at > it...>*
Inviato da: bacula-users-bounces < at > li...

26/02/2007 20.33


Per
bacula-users <bacula-users < at > li...>
CC

Oggetto
Re: [Bacula-users] Change tape problem








Hello,

On 2/26/2007 10:54 AM, Ferdinando Pasqualetti wrote:

Hi Bacula users,
sorry if you get this message two times, I sent it with a wrong sender
(not in the list), so I am sending it again.
I am facing a problem that came out with rev. 1.38.11 (I never saw it
with 1.36.3). The problem did not happen all times, but very often. Now
I switched to 2.0.2 and this problem is much more frequent.
The problem is that when a tape was exhausted bacula changes correctly
the tape in the autochanger drive but just after get this error:

25-feb 02:47 bacula-sd: End of Volume "web-004" at 594:3362 on device
"LTO1" (/dev/lto1). Write of 64512 bytes got -1.
25-feb 02:47 bacula-sd: Re-read of last block succeeded.
25-feb 02:47 bacula-sd: End of medium on Volume "web-004"
Bytes=594,178,937,856 Blocks=9,210,362 at 25-feb-2007 02:47.
25-feb 02:47 bacula-sd: 3301 Issuing autochanger "loaded? drive 0"
command.
25-feb 02:47 bacula-sd: 3302 Autochanger "loaded? drive 0", result is
Slot 3.
25-feb 02:47 bacula-sd: 3307 Issuing autochanger "unload slot 3, drive
0" command.
25-feb 02:48 bacula-sd: 3304 Issuing autochanger "load slot 4, drive 0"
command.
25-feb 02:48 bacula-sd: 3305 Autochanger "load slot 4, drive 0", status
is OK.
25-feb 02:48 bacula-sd: 3301 Issuing autochanger "loaded? drive 0"
command.
25-feb 02:48 bacula-sd: 3302 Autochanger "loaded? drive 0", result is
Slot 4.
25-feb 02:49 bacula-sd: Wrote label to prelabeled Volume "web-005" on
device "LTO1" (/dev/lto1)
25-feb 02:49 bacula-sd: New volume "web-005" mounted on device "LTO1"
(/dev/lto1) at 25-feb-2007 02:49.
25-feb 02:49 bacula-sd: End of Volume "web-005" at 0:1 on device "LTO1"
(/dev/lto1). Write of 64512 bytes got -1.
25-feb 02:49 bacula-sd: webfs3-job.2007-02-24_20.03.22 Error: Re-read of
last block OK, but block numbers differ. Last block=0 Current
block=9210362.
25-feb 02:49 bacula-sd: Job write elapsed time = 06:43:26, Transfer rate
= 24.52 M bytes/second
25-feb 02:49 webfs3: webfs3-job.2007-02-24_20.03.22 Fatal error:
backup.c:860 Network send error to SD. ERR=Pipe rotta
25-feb 02:49 bacula-dir: webfs3-job.2007-02-24_20.03.22 Error: Bacula
2.0.2 (28Jan07): 25-feb-2007 02:49:25

It seems there are two problems, the first one (and the most important
one) is that bacula get an end of volume on the new tape,

What Bacula reports as an EOT can be caused by a drive error, too, so
for the time being I assume that the second error is tightly related to
this one.

and the second
one is the difference in the last block (it appears to be the last block
of the previous tape).

If that's the case, and your description seems quite clear, you might
have found an OS or hardware bug, too.

This is only guesswork, but it could be possible that, after a tape
change, the hardware or the tape driver don't update their state
information.

If that's the case, you could try the following:
- first, have a look at your system log and dmesg output. There might be
errors reported there.
- second, try to reproduce the problem without using Bacula. Unmount the
tape drive from bconsole. Load a tape (an unused one, or one with write
protection). If you use an empty tape, write some data and some file
marks to it, ending with an EOT mark. dd and mt are tools for that purpose.
Then, use tapeinfo or st to observe the tape status, especialy the block
position reported, when doing some rewinds, fast forwards, offline, and
see what happens after you used mtx to unload and reload that tape.

If there really is a problem with the hardware or the OD driver, you
should be able to reproduce it then. Updating the drive firmware and the
OS (or, if that's up to date, filing a bug report) would be two options
then.

Otherwise, you should run btape again, because there are some things in
the report I don't like - errors writing the last block to tape should
not happen with current hardware, for example. You might try to tune
your device configuration, and perhaps you'll have to set the tape
driver to a different write mode. Suggesting something is difficult
without seeing how it's setup now :-)

Bacula is a MySQL version on a RedHat AS 4.04, rpmbuilt on that system,
an HP proliant G3 3.2 Ghz, 2Gb RAM.
The tape is an MSL6000 with two LTO-3 drives, drived by bacula directly
(not using the autochanger as device - 1.36.3 setup).
Btape tests run correctly, including the "fill and change tape" (I am
attaching the test result, if someone is interested).
Did anyone get a similar problem?

That basic setup should run ok I think... nothing unusual there.

Arno

--------------------------------------------------------------------------
Ferdinando Pasqualetti
G.T.Dati srl
Tel. 0557310862 - 3356172731 - Fax 055720143



------------------------------------------------------------------------

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to
share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV


------------------------------------------------------------------------

_______________________________________________
Bacula-users mailing list
Bacula-users < at > li...
https://lists.sourceforge.net/lists/listinfo/bacula-users

--
IT-Service Lehmann al < at > it...
Arno Lehmann http://www.its-lehmann.de

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Bacula-users mailing list
Bacula-users < at > li...
https://lists.sourceforge.net/lists/listinfo/bacula-users



------------------------------------------------------------------------

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV


------------------------------------------------------------------------

_______________________________________________
Bacula-users mailing list
Bacula-users < at > li...
https://lists.sourceforge.net/lists/listinfo/bacula-users

--
IT-Service Lehmann al < at > it...
Arno Lehmann http://www.its-lehmann.de

Post Rif: Re: Change tape problem 
<br><font size=2 face="sans-serif">Hi everybody,</font>
<br><font size=2 face="sans-serif">I am following Arno's suggestion about
a problem in writing additional tapes started with 1.38.11 and persisting
in 2.0.2. Arno's idea is that the problem could be on tape or OS and maybe
he is right, but the same hardware/OS with 1.36.3 did not have the
problem (it could also be a hardware failure in the meantime, but apparently
all the rest works).</font>
<br><font size=2 face="sans-serif">What I think happen is that once there
have been an ERNOSPC writing a tape block the last block is correctly reread
and the tape changed by the autochanger with a prelabelled tape, the the
label is rewritten (or skipped, I do not know) and when the block is written
on the new tape it get again the ERNOSPC error. This is reported again
as an EOT, but the last block control fails and the job fails too.</font>
<br><font size=2 face="sans-serif">I have also made a very small modification
to the block.c routine (I am not a programmer and more then that I am not
a C programmer, so this change is not correct for sure, even if it solves
the problem in some way). The change is:</font>
<br>
<br><font size=2 face="sans-serif">532d531</font>
<br><font size=2 face="sans-serif">&lt; if (dev-&gt;file ==
0) { dev-&gt;clrerror(-1); }</font>
<br>
<br><font size=2 face="sans-serif">that simply means (I hope) "clear
all errors before writing the block if this is the first tape file"</font>
<br><font size=2 face="sans-serif">That because the write continues to
get the errors until an EOF is written. Because the file is normally around
1 Gb this slows down performance to 20-25% during this phase, but I can
deal with it.</font>
<br>
<br><font size=2 face="sans-serif">What I would like to know is if this
behaviour is really an hardware/firmare/OS problem according to your opinion
and if the block routine could br made more resilient in some way (EOF
mark at tape end, Closing and reopening of fd or whatever).</font>
<br>
<br><font size=2 face="sans-serif">Many thanks if you would like to give
me an asnswer and very many thanks anyway for this great package.</font>
<br>
<br>
<br>
<br><font size=2 face="sans-serif">--------------------------------------------------------------------------<br>
Ferdinando Pasqualetti<br>
G.T.Dati srl<br>
Tel. 0557310862 - 3356172731 - Fax 055720143<br>
<br>
<br>
</font>
<br><font size=2><tt>Hi,<br>
<br>
On 3/11/2007 6:33 PM, Ferdinando Pasqualetti wrote:<br>
&gt; <br>
&gt; Hi Arno,<br>
&gt; I made some tests and this is what I think.<br>
&gt; When there is a tape change after an out of space error susequent
block <br>
&gt; write continue to get that error even after the tape change by the
<br>
&gt; robot. This continue.<br>
&gt; I made some changes to the block.c routine<br>
<br>
You'd better discuss this at bacula-devel, I think, or send Kern a mail
<br>
explaining the problem and the resolution.<br>
<br>
&gt; (very simple, because I'm not <br>
&gt; a C programmer and also I don't know the logic of sd program). I made
<br>
&gt; the routine enter the retry loop even for ERNOSPC if file number is
0. <br>
&gt; This made bacula-sd work correctly (but it took 20 hours to write
file <br>
&gt; 0). After writing the EOF mark speed is normal again.<br>
&gt; My idea is that changing the tape does not reset the EOD condition
on <br>
&gt; the tape<br>
<br>
That sounds like a bug, either in the hardware or the HBA driver.<br>
<br>
&gt; until a file mark is written. I do not know if this a wrong <br>
&gt; device or OS error, but I believe that the FD of tape should be closed
<br>
&gt; and reopened in a tape change.<br>
<br>
I know about nothing about these details, so I won't comment on it...<br>
<br>
&gt; dd and mt tests always gave correct results, but dd always write an
EOF <br>
&gt; mark at the and of the transfer.<br>
&gt; <br>
&gt; If you have some idea about that I will be very happy. Thank you very
<br>
&gt; much in any c asze.<br>
<br>
Difficult problem, I think.<br>
<br>
If this is a hardware or driver problem, I don't think modifying the SD
<br>
code is the right solution.<br>
<br>
If it works for you - fine, but it might be that you have to manage that
<br>
patch for your installation yourself.<br>
<br>
Arno<br>
<br>
&gt; <br>
&gt; --------------------------------------------------------------------------<br>
&gt; Ferdinando Pasqualetti<br>
&gt; G.T.Dati srl<br>
&gt; Tel. 0557310862 - 3356172731 - Fax 055720143<br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; *Ferdinando Pasqualetti/San Lazzaro/Conserve Italia*<br>
&gt; <br>
&gt; 27/02/2007 09.47<br>
&gt; <br>
&gt; <br>
&gt; Per<br>
&gt; Arno
Lehmann &lt;al < at > it...&gt;<br>
&gt; CC<br>
&gt; bacula-users
&lt;bacula-users < at > li...&gt;<br>
&gt; Oggetto<br>
&gt; Rif:
Re: [Bacula-users] Change tape problem Link <br>
&gt; &lt;Notes:///C12563A900369A93/D46731D63F38165B8025651C003EAC4E/56E29AEC82B8E837C125728E006B77F9&gt;<br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; Hi Arno,<br>
&gt; thank you very much for your answer. I will try asap the tests you
are <br>
&gt; suggesting. By the way, I purged the volumes involved in the error
shown <br>
&gt; in the original message (it was the third try), restarted the backup
job <br>
&gt; and here is the (correct) result.<br>
&gt; <br>
&gt; 25-feb 19:55 bacula-dir: Start Backup JobId 12927, <br>
&gt; Job=webfs3-job.2007-02-25_19.55.40<br>
&gt; 25-feb 19:55 bacula-dir: Recycled volume "web-004"<br>
&gt; 25-feb 19:55 webfs3: ClientRunBeforeJob: run command "/root/restartsmb"<br>
&gt; 25-feb 19:55 webfs3: ClientRunBeforeJob: Shutting down SMB services:
[ <br>
&gt; OK ]<br>
&gt; 25-feb 19:55 webfs3: ClientRunBeforeJob: smbd: nessun processo terminato<br>
&gt; 25-feb 19:55 webfs3: ClientRunBeforeJob: smbd: nessun processo terminato<br>
&gt; 25-feb 19:55 webfs3: ClientRunBeforeJob: Starting SMB services: [
OK ]<br>
&gt; 25-feb 19:55 webfs3: ClientRunBeforeJob: [ OK ]<br>
&gt; 25-feb 19:55 bacula-sd: 3307 Issuing autochanger "unload slot
7, drive <br>
&gt; 0" command.<br>
&gt; 25-feb 19:57 bacula-sd: 3304 Issuing autochanger "load slot 3,
drive 0" <br>
&gt; command.<br>
&gt; 25-feb 19:57 bacula-sd: 3305 Autochanger "load slot 3, drive
0", status <br>
&gt; is OK.<br>
&gt; 25-feb 19:57 bacula-sd: 3301 Issuing autochanger "loaded? drive
0" command.<br>
&gt; 25-feb 19:57 bacula-sd: 3302 Autochanger "loaded? drive 0",
result is <br>
&gt; Slot 3.<br>
&gt; 25-feb 19:57 bacula-sd: Recycled volume "web-004" on device
"LTO1" <br>
&gt; (/dev/lto1), all previous data lost.<br>
&gt; webfs3: /proc is a different filesystem. Will
not descend from / <br>
&gt; into /proc<br>
&gt; webfs3: /boot is a different filesystem. Will
not descend from / <br>
&gt; into /boot<br>
&gt; webfs3: /dev is a different filesystem. Will not
descend from / <br>
&gt; into /dev<br>
&gt; webfs3: /var/lib/nfs/rpc_pipefs is a different
filesystem. Will not <br>
&gt; descend from / into /var/lib/nfs/rpc_pipefs<br>
&gt; webfs3: /sys is a different filesystem. Will not
descend from / <br>
&gt; into /sys<br>
&gt; webfs3: /uno is a different filesystem. Will not
descend from / <br>
&gt; into /uno<br>
&gt; 26-feb 04:14 bacula-sd: End of Volume "web-004" at 594:6519
on device <br>
&gt; "LTO1" (/dev/lto1). Write of 64512 bytes got -1.<br>
&gt; 26-feb 04:14 bacula-sd: Re-read of last block succeeded.<br>
&gt; 26-feb 04:14 bacula-sd: End of medium on Volume "web-004"
<br>
&gt; Bytes=594,382,602,240 Blocks=9,213,519 at 26-feb-2007 04:14.<br>
&gt; 26-feb 04:14 bacula-dir: Recycled volume "web-005"<br>
&gt; 26-feb 04:14 bacula-sd: 3301 Issuing autochanger "loaded? drive
0" command.<br>
&gt; 26-feb 04:14 bacula-sd: 3302 Autochanger "loaded? drive 0",
result is <br>
&gt; Slot 3.<br>
&gt; 26-feb 04:14 bacula-sd: 3307 Issuing autochanger "unload slot
3, drive <br>
&gt; 0" command.<br>
&gt; 26-feb 04:15 bacula-sd: 3304 Issuing autochanger "load slot 4,
drive 0" <br>
&gt; command.<br>
&gt; 26-feb 04:15 bacula-sd: 3305 Autochanger "load slot 4, drive
0", status <br>
&gt; is OK.<br>
&gt; 26-feb 04:15 bacula-sd: 3301 Issuing autochanger "loaded? drive
0" command.<br>
&gt; 26-feb 04:15 bacula-sd: 3302 Autochanger "loaded? drive 0",
result is <br>
&gt; Slot 4.<br>
&gt; 26-feb 04:15 bacula-sd: Recycled volume "web-005" on device
"LTO1" <br>
&gt; (/dev/lto1), all previous data lost.<br>
&gt; 26-feb 04:15 bacula-sd: New volume "web-005" mounted on
device "LTO1" <br>
&gt; (/dev/lto1) at 26-feb-2007 04:15.<br>
&gt; 26-feb 10:21 bacula-sd: End of Volume "web-005" at 528:6656
on device <br>
&gt; "LTO1" (/dev/lto1). Write of 64512 bytes got -1.<br>
&gt; 26-feb 10:21 bacula-sd: Re-read of last block succeeded.<br>
&gt; 26-feb 10:21 bacula-sd: End of medium on Volume "web-005"
<br>
&gt; Bytes=528,395,664,384 Blocks=8,190,656 at 26-feb-2007 10:21.<br>
&gt; 26-feb 10:21 bacula-dir: Recycled volume "web-006"<br>
&gt; 26-feb 10:21 bacula-sd: 3301 Issuing autochanger "loaded? drive
0" command.<br>
&gt; 26-feb 10:21 bacula-sd: 3302 Autochanger "loaded? drive 0",
result is <br>
&gt; Slot 4.<br>
&gt; 26-feb 10:21 bacula-sd: 3307 Issuing autochanger "unload slot
4, drive <br>
&gt; 0" command.<br>
&gt; 26-feb 10:22 bacula-sd: 3304 Issuing autochanger "load slot 5,
drive 0" <br>
&gt; command.<br>
&gt; 26-feb 10:22 bacula-sd: 3305 Autochanger "load slot 5, drive
0", status <br>
&gt; is OK.<br>
&gt; 26-feb 10:22 bacula-sd: 3301 Issuing autochanger "loaded? drive
0" command.<br>
&gt; 26-feb 10:22 bacula-sd: 3302 Autochanger "loaded? drive 0",
result is <br>
&gt; Slot 5.<br>
&gt; 26-feb 10:23 bacula-sd: Recycled volume "web-006" on device
"LTO1" <br>
&gt; (/dev/lto1), all previous data lost.<br>
&gt; 26-feb 10:23 bacula-sd: New volume "web-006" mounted on
device "LTO1" <br>
&gt; (/dev/lto1) at 26-feb-2007 10:23.<br>
&gt; 26-feb 13:49 bacula-sd: Job write elapsed time = 17:48:45, Transfer
rate <br>
&gt; = 21.65 M bytes/second<br>
&gt; 26-feb 13:49 bacula-sd: Alert: SCSI 2 tape drive:<br>
&gt; 26-feb 13:49 bacula-sd: Alert: File number=267, block number=0, partition=0.<br>
&gt; 26-feb 13:49 bacula-sd: Alert: Tape block size 0 bytes. Density code
<br>
&gt; 0x44 (no translation).<br>
&gt; 26-feb 13:49 bacula-sd: Alert: Soft error count since last status=0<br>
&gt; 26-feb 13:49 bacula-sd: Alert: General status bits on (81010000):<br>
&gt; 26-feb 13:49 bacula-sd: Alert: EOF ONLINE IM_REP_EN<br>
&gt; 26-feb 13:49 bacula-dir: Bacula 2.0.2 (28Jan07): 26-feb-2007 13:49:03<br>
&gt; JobId:
12927<br>
&gt; Job:
webfs3-job.2007-02-25_19.55.40<br>
&gt; Backup Level: Full<br>
&gt; Client:
"webfs3" 2.0.2 (28Jan07) <br>
&gt; i686-redhat-linux-gnu,redhat,Enterprise release<br>
&gt; FileSet: "webfs3-fileset"
2005-04-30 07:13:53<br>
&gt; Pool:
"webfs" (From Job resource)<br>
&gt; Storage: "LTO-1"
(From user selection)<br>
&gt; Scheduled time: 25-feb-2007 19:55:17<br>
&gt; Start time: 25-feb-2007
19:55:46<br>
&gt; End time: 26-feb-2007
13:49:03<br>
&gt; Elapsed time: 17 hours 53
mins 17 secs<br>
&gt; Priority: 10<br>
&gt; FD Files Written: 4,046,880<br>
&gt; SD Files Written: 4,046,880<br>
&gt; FD Bytes Written: 1,387,910,783,372 (1.387
TB)<br>
&gt; SD Bytes Written: 1,388,589,182,436 (1.388
TB)<br>
&gt; Rate:
21552.4 KB/s<br>
&gt; Software Compression: None<br>
&gt; VSS:
no<br>
&gt; Encryption: no<br>
&gt; Volume name(s): web-004|web-005|web-006<br>
&gt; Volume Session Id: 1<br>
&gt; Volume Session Time: 1172427565<br>
&gt; Last Volume Bytes: 266,951,559,168 (266.9
GB)<br>
&gt; Non-fatal FD errors: 0<br>
&gt; SD Errors: 0<br>
&gt; FD termination status: OK<br>
&gt; SD termination status: OK<br>
&gt; Termination: Backup
OK<br>
&gt; <br>
&gt; <br>
&gt; The thing that is not in favour of an hardware or OS problem is that
<br>
&gt; with the same hardware and OS bacula 1.36.3 had not this problem,
it <br>
&gt; arised with 1.38.11.<br>
&gt; The device setup is quite simple:<br>
&gt; <br>
&gt; <br>
&gt; Device {<br>
&gt; Name = LTO1<br>
&gt; Media Type = LTO-3<br>
&gt; Archive Device = /dev/lto1<br>
&gt; AutomaticMount = yes;
# when device opened, read it<br>
&gt; AlwaysOpen = no;<br>
&gt; Autoselect = no<br>
&gt; RemovableMedia = yes;<br>
&gt; RandomAccess = no;<br>
&gt; Changer Command = "/etc/bacula/mtx-changer %c %o %S %a
%d"<br>
&gt; Changer Device = /dev/chg4<br>
&gt; Drive Index = 0<br>
&gt; AutoChanger = yes<br>
&gt; Alert Command = "sh -c 'mt -f %a status'"<br>
&gt; Maximum Network Buffer Size = 65536<br>
&gt; }<br>
&gt; <br>
&gt; Devices /dev/lto1 and /dev/chg4 are symlinks to real devices in order
to <br>
&gt; manage hardware configuration changes.<br>
&gt; <br>
&gt; Thanks again<br>
&gt; <br>
&gt; --------------------------------------------------------------------------<br>
&gt; Ferdinando Pasqualetti<br>
&gt; G.T.Dati srl<br>
&gt; Tel. 0557310862 - 3356172731 - Fax 055720143<br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; *Arno Lehmann &lt;al < at > it...&gt;*<br>
&gt; Inviato da: bacula-users-bounces < at > li...<br>
&gt; <br>
&gt; 26/02/2007 20.33<br>
&gt; <br>
&gt; <br>
&gt; Per<br>
&gt; bacula-users
&lt;bacula-users < at > li...&gt;<br>
&gt; CC<br>
&gt; <br>
&gt; Oggetto<br>
&gt; Re:
[Bacula-users] Change tape problem<br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; Hello,<br>
&gt; <br>
&gt; On 2/26/2007 10:54 AM, Ferdinando Pasqualetti wrote:<br>
&gt; &gt;<br>
&gt; &gt; Hi Bacula users,<br>
&gt; &gt; sorry if you get this message two times, I sent it with
a wrong sender<br>
&gt; &gt; (not in the list), so I am sending it again.<br>
&gt; &gt; I am facing a problem that came out with rev. 1.38.11 (I
never saw it<br>
&gt; &gt; with 1.36.3). The problem did not happen all times, but
very often. Now<br>
&gt; &gt; I switched to 2.0.2 and this problem is much more frequent.<br>
&gt; &gt; The problem is that when a tape was exhausted bacula changes
correctly<br>
&gt; &gt; the tape in the autochanger drive but just after get this
error:<br>
&gt; &gt;<br>
&gt; &gt; 25-feb 02:47 bacula-sd: End of Volume "web-004"
at 594:3362 on device<br>
&gt; &gt; "LTO1" (/dev/lto1). Write of 64512 bytes got
-1.<br>
&gt; &gt; 25-feb 02:47 bacula-sd: Re-read of last block succeeded.<br>
&gt; &gt; 25-feb 02:47 bacula-sd: End of medium on Volume "web-004"<br>
&gt; &gt; Bytes=594,178,937,856 Blocks=9,210,362 at 25-feb-2007 02:47.<br>
&gt; &gt; 25-feb 02:47 bacula-sd: 3301 Issuing autochanger "loaded?
drive 0" <br>
&gt; command.<br>
&gt; &gt; 25-feb 02:47 bacula-sd: 3302 Autochanger "loaded?
drive 0", result is<br>
&gt; &gt; Slot 3.<br>
&gt; &gt; 25-feb 02:47 bacula-sd: 3307 Issuing autochanger "unload
slot 3, drive<br>
&gt; &gt; 0" command.<br>
&gt; &gt; 25-feb 02:48 bacula-sd: 3304 Issuing autochanger "load
slot 4, drive 0"<br>
&gt; &gt; command.<br>
&gt; &gt; 25-feb 02:48 bacula-sd: 3305 Autochanger "load slot
4, drive 0", status<br>
&gt; &gt; is OK.<br>
&gt; &gt; 25-feb 02:48 bacula-sd: 3301 Issuing autochanger "loaded?
drive 0" <br>
&gt; command.<br>
&gt; &gt; 25-feb 02:48 bacula-sd: 3302 Autochanger "loaded?
drive 0", result is<br>
&gt; &gt; Slot 4.<br>
&gt; &gt; 25-feb 02:49 bacula-sd: Wrote label to prelabeled Volume
"web-005" on<br>
&gt; &gt; device "LTO1" (/dev/lto1)<br>
&gt; &gt; 25-feb 02:49 bacula-sd: New volume "web-005"
mounted on device "LTO1"<br>
&gt; &gt; (/dev/lto1) at 25-feb-2007 02:49.<br>
&gt; &gt; 25-feb 02:49 bacula-sd: End of Volume "web-005"
at 0:1 on device "LTO1"<br>
&gt; &gt; (/dev/lto1). Write of 64512 bytes got -1.<br>
&gt; &gt; 25-feb 02:49 bacula-sd: webfs3-job.2007-02-24_20.03.22
Error: Re-read of<br>
&gt; &gt; last block OK, but block numbers differ. Last block=0 Current
<br>
&gt; block=9210362.<br>
&gt; &gt; 25-feb 02:49 bacula-sd: Job write elapsed time = 06:43:26,
Transfer rate<br>
&gt; &gt; = 24.52 M bytes/second<br>
&gt; &gt; 25-feb 02:49 webfs3: webfs3-job.2007-02-24_20.03.22 Fatal
error:<br>
&gt; &gt; backup.c:860 Network send error to SD. ERR=Pipe rotta<br>
&gt; &gt; 25-feb 02:49 bacula-dir: webfs3-job.2007-02-24_20.03.22
Error: Bacula<br>
&gt; &gt; 2.0.2 (28Jan07): 25-feb-2007 02:49:25<br>
&gt; &gt;<br>
&gt; &gt; It seems there are two problems, the first one (and the
most important<br>
&gt; &gt; one) is that bacula get an end of volume on the new tape,<br>
&gt; <br>
&gt; What Bacula reports as an EOT can be caused by a drive error, too,
so<br>
&gt; for the time being I assume that the second error is tightly related
to<br>
&gt; this one.<br>
&gt; <br>
&gt; &gt; and the second<br>
&gt; &gt; one is the difference in the last block (it appears to
be the last block<br>
&gt; &gt; of the previous tape).<br>
&gt; <br>
&gt; If that's the case, and your description seems quite clear, you might<br>
&gt; have found an OS or hardware bug, too.<br>
&gt; <br>
&gt; This is only guesswork, but it could be possible that, after a tape<br>
&gt; change, the hardware or the tape driver don't update their state<br>
&gt; information.<br>
&gt; <br>
&gt; If that's the case, you could try the following:<br>
&gt; - first, have a look at your system log and dmesg output. There might
be<br>
&gt; errors reported there.<br>
&gt; - second, try to reproduce the problem without using Bacula. Unmount
the<br>
&gt; tape drive from bconsole. Load a tape (an unused one, or one with
write<br>
&gt; protection). If you use an empty tape, write some data and some file<br>
&gt; marks to it, ending with an EOT mark. dd and mt are tools for that
purpose.<br>
&gt; Then, use tapeinfo or st to observe the tape status, especialy the
block<br>
&gt; position reported, when doing some rewinds, fast forwards, offline,
and<br>
&gt; see what happens after you used mtx to unload and reload that tape.<br>
&gt; <br>
&gt; If there really is a problem with the hardware or the OD driver, you<br>
&gt; should be able to reproduce it then. Updating the drive firmware and
the<br>
&gt; OS (or, if that's up to date, filing a bug report) would be two options<br>
&gt; then.<br>
&gt; <br>
&gt; Otherwise, you should run btape again, because there are some things
in<br>
&gt; the report I don't like - errors writing the last block to tape should<br>
&gt; not happen with current hardware, for example. You might try to tune<br>
&gt; your device configuration, and perhaps you'll have to set the tape<br>
&gt; driver to a different write mode. Suggesting something is difficult<br>
&gt; without seeing how it's setup now Smile<br>
&gt; <br>
&gt; &gt; Bacula is a MySQL version on a RedHat AS 4.04, rpmbuilt
on that system,<br>
&gt; &gt; an HP proliant G3 3.2 Ghz, 2Gb RAM.<br>
&gt; &gt; The tape is an MSL6000 with two LTO-3 drives, drived by
bacula directly<br>
&gt; &gt; (not using the autochanger as device - 1.36.3 setup).<br>
&gt; &gt; Btape tests run correctly, including the "fill and
change tape" (I am<br>
&gt; &gt; attaching the test result, if someone is interested).<br>
&gt; &gt; Did anyone get a similar problem?<br>
&gt; <br>
&gt; That basic setup should run ok I think... nothing unusual there.<br>
&gt; <br>
&gt; Arno<br>
&gt; <br>
&gt; &gt;<br>
&gt; &gt; <br>
&gt; --------------------------------------------------------------------------<br>
&gt; &gt; Ferdinando Pasqualetti<br>
&gt; &gt; G.T.Dati srl<br>
&gt; &gt; Tel. 0557310862 - 3356172731 - Fax 055720143<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt; ------------------------------------------------------------------------<br>
&gt; &gt;<br>
&gt; &gt; -------------------------------------------------------------------------<br>
&gt; &gt; Take Surveys. Earn Cash. Influence the Future of IT<br>
&gt; &gt; Join SourceForge.net's Techsay panel and you'll get the
chance to <br>
&gt; share your<br>
&gt; &gt; opinions on IT &amp; business topics through brief surveys-and
earn cash<br>
&gt; &gt; http://www.techsay.com/default.php?page=join.php&amp;p=sourceforge&amp;CID=DEVDEV<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt; ------------------------------------------------------------------------<br>
&gt; &gt;<br>
&gt; &gt; _______________________________________________<br>
&gt; &gt; Bacula-users mailing list<br>
&gt; &gt; Bacula-users < at > li...<br>
&gt; &gt; https://lists.sourceforge.net/lists/listinfo/bacula-users<br>
&gt; <br>
&gt; -- <br>
&gt; IT-Service Lehmann
al < at > it...<br>
&gt; Arno Lehmann
http://www.its-lehmann.de<br>
&gt; <br>
&gt; -------------------------------------------------------------------------<br>
&gt; Take Surveys. Earn Cash. Influence the Future of IT<br>
&gt; Join SourceForge.net's Techsay panel and you'll get the chance to
share your<br>
&gt; opinions on IT &amp; business topics through brief surveys-and earn
cash<br>
&gt; http://www.techsay.com/default.php?page=join.php&amp;p=sourceforge&amp;CID=DEVDEV<br>
&gt; _______________________________________________<br>
&gt; Bacula-users mailing list<br>
&gt; Bacula-users < at > li...<br>
&gt; https://lists.sourceforge.net/lists/listinfo/bacula-users<br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; ------------------------------------------------------------------------<br>
&gt; <br>
&gt; -------------------------------------------------------------------------<br>
&gt; Take Surveys. Earn Cash. Influence the Future of IT<br>
&gt; Join SourceForge.net's Techsay panel and you'll get the chance to
share your<br>
&gt; opinions on IT &amp; business topics through brief surveys-and earn
cash<br>
&gt; http://www.techsay.com/default.php?page=join.php&amp;p=sourceforge&amp;CID=DEVDEV<br>
&gt; <br>
&gt; <br>
&gt; ------------------------------------------------------------------------<br>
&gt; <br>
&gt; _______________________________________________<br>
&gt; Bacula-users mailing list<br>
&gt; Bacula-users < at > li...<br>
&gt; https://lists.sourceforge.net/lists/listinfo/bacula-users<br>
<br>
-- <br>
IT-Service Lehmann
al < at > it...<br>
Arno Lehmann http://www.its-lehmann.de<br>
<br>
-------------------------------------------------------------------------<br>
Take Surveys. Earn Cash. Influence the Future of IT<br>
Join SourceForge.net's Techsay panel and you'll get the chance to share
your<br>
opinions on IT &amp; business topics through brief surveys-and earn cash<br>

http://www.techsay.com/default.php?page=join.php&amp;p=sourceforge&amp;CID=DEVDEV<br>
_______________________________________________<br>
Bacula-users mailing list<br>
Bacula-users < at > li...<br>
https://lists.sourceforge.net/lists/listinfo/bacula-users<br>
</tt></font>

Post Rif: Re: Change tape problem 
<br><font size=2 face="sans-serif">Hi Kern,</font>
<br><font size=2 face="sans-serif">sorry to ask you directly. Some days
ago I sent the same email to devel-list, but I got no answer, so I am making
a last try. Please tell me if the problem is not interesting because no
one else signalled it, and it is not useful trying to understand it or
if it is not clearly defined.</font>
<br>
<br><font size=2 face="sans-serif">I am following Arno's suggestion about
a problem in writing additional tapes started with 1.38.11 and persisting
in 2.0.2. Arno's idea is that the problem could be on tape or OS and maybe
he is right, but the same hardware/OS with 1.36.3 did not have the
problem (it could also be a hardware failure in the meantime, but apparently
all the rest works).</font>
<br>
<br><font size=2 face="sans-serif">What I think happen is that once there
have been an ERNOSPC writing a tape block the last block is correctly reread
and the tape changed by the autochanger with a prelabelled tape, the the
label is rewritten (or skipped, I do not know) and when the block is written
on the new tape it get again the ERNOSPC error. This is reported again
as an EOT, but the last block control fails and the job fails too (this
is a consequetial error anyway).</font>
<br>
<br><font size=2 face="sans-serif">I have also made a very small modification
to the block.c routine (I am not a programmer and more then that I am not
a C programmer, so this change is not correct for sure, even if it solves
the problem in some way). The change is:</font>
<br>
<br><font size=2 face="sans-serif">532d531</font>
<br><font size=2 face="sans-serif">&lt; if (dev-&gt;file ==
0) { dev-&gt;clrerror(-1); }</font>
<br>
<br><font size=2 face="sans-serif">that simply means (I hope) "clear
all errors before writing the block if this is the first tape file"</font>
<br><font size=2 face="sans-serif">That because the write continues to
get the errors until an EOF is written. Because the file is normally around
1 Gb this slows down performance to 20-25% during this phase, but I can
deal with it.</font>
<br>
<br><font size=2 face="sans-serif">What I would like to know is if this
behaviour is really an hardware/firmare/OS problem according to your opinion
and if the block routine could br made more resilient in some way (EOF
mark at tape end, Closing and reopening of fd or whatever).</font>
<br>
<br><font size=2 face="sans-serif">Many thanks if you would like to give
me an asnswer and very many thanks anyway for this great package.</font>
<br>
<br><font size=2 face="sans-serif">Sincerely,</font>
<br>
<br><font size=2 face="sans-serif">--------------------------------------------------------------------------<br>
Ferdinando Pasqualetti<br>
G.T.Dati srl<br>
Tel. 0557310862 - 3356172731 - Fax 055720143<br>
<br>
<br>
</font>
<br><font size=2><tt>Hi,<br>
<br>
On 3/11/2007 6:33 PM, Ferdinando Pasqualetti wrote:<br>
&gt; <br>
&gt; Hi Arno,<br>
&gt; I made some tests and this is what I think.<br>
&gt; When there is a tape change after an out of space error susequent
block <br>
&gt; write continue to get that error even after the tape change by the
<br>
&gt; robot. This continue.<br>
&gt; I made some changes to the block.c routine<br>
<br>
You'd better discuss this at bacula-devel, I think, or send Kern a mail
<br>
explaining the problem and the resolution.<br>
<br>
&gt; (very simple, because I'm not <br>
&gt; a C programmer and also I don't know the logic of sd program). I made
<br>
&gt; the routine enter the retry loop even for ERNOSPC if file number is
0. <br>
&gt; This made bacula-sd work correctly (but it took 20 hours to write
file <br>
&gt; 0). After writing the EOF mark speed is normal again.<br>
&gt; My idea is that changing the tape does not reset the EOD condition
on <br>
&gt; the tape<br>
<br>
That sounds like a bug, either in the hardware or the HBA driver.<br>
<br>
&gt; until a file mark is written. I do not know if this a wrong <br>
&gt; device or OS error, but I believe that the FD of tape should be closed
<br>
&gt; and reopened in a tape change.<br>
<br>
I know about nothing about these details, so I won't comment on it...<br>
<br>
&gt; dd and mt tests always gave correct results, but dd always write an
EOF <br>
&gt; mark at the and of the transfer.<br>
&gt; <br>
&gt; If you have some idea about that I will be very happy. Thank you very
<br>
&gt; much in any c asze.<br>
<br>
Difficult problem, I think.<br>
<br>
If this is a hardware or driver problem, I don't think modifying the SD
<br>
code is the right solution.<br>
<br>
If it works for you - fine, but it might be that you have to manage that
<br>
patch for your installation yourself.<br>
<br>
Arno<br>
<br>
&gt; <br>
&gt; --------------------------------------------------------------------------<br>
&gt; Ferdinando Pasqualetti<br>
&gt; G.T.Dati srl<br>
&gt; Tel. 0557310862 - 3356172731 - Fax 055720143<br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; *Ferdinando Pasqualetti/San Lazzaro/Conserve Italia*<br>
&gt; <br>
&gt; 27/02/2007 09.47<br>
&gt; <br>
&gt; <br>
&gt; Per<br>
&gt; Arno
Lehmann &lt;al < at > it...&gt;<br>
&gt; CC<br>
&gt; bacula-users
&lt;bacula-users < at > li...&gt;<br>
&gt; Oggetto<br>
&gt; Rif:
Re: [Bacula-users] Change tape problem Link <br>
&gt; &lt;Notes:///C12563A900369A93/D46731D63F38165B8025651C003EAC4E/56E29AEC82B8E837C125728E006B77F9&gt;<br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; Hi Arno,<br>
&gt; thank you very much for your answer. I will try asap the tests you
are <br>
&gt; suggesting. By the way, I purged the volumes involved in the error
shown <br>
&gt; in the original message (it was the third try), restarted the backup
job <br>
&gt; and here is the (correct) result.<br>
&gt; <br>
&gt; 25-feb 19:55 bacula-dir: Start Backup JobId 12927, <br>
&gt; Job=webfs3-job.2007-02-25_19.55.40<br>
&gt; 25-feb 19:55 bacula-dir: Recycled volume "web-004"<br>
&gt; 25-feb 19:55 webfs3: ClientRunBeforeJob: run command "/root/restartsmb"<br>
&gt; 25-feb 19:55 webfs3: ClientRunBeforeJob: Shutting down SMB services:
[ <br>
&gt; OK ]<br>
&gt; 25-feb 19:55 webfs3: ClientRunBeforeJob: smbd: nessun processo terminato<br>
&gt; 25-feb 19:55 webfs3: ClientRunBeforeJob: smbd: nessun processo terminato<br>
&gt; 25-feb 19:55 webfs3: ClientRunBeforeJob: Starting SMB services: [
OK ]<br>
&gt; 25-feb 19:55 webfs3: ClientRunBeforeJob: [ OK ]<br>
&gt; 25-feb 19:55 bacula-sd: 3307 Issuing autochanger "unload slot
7, drive <br>
&gt; 0" command.<br>
&gt; 25-feb 19:57 bacula-sd: 3304 Issuing autochanger "load slot 3,
drive 0" <br>
&gt; command.<br>
&gt; 25-feb 19:57 bacula-sd: 3305 Autochanger "load slot 3, drive
0", status <br>
&gt; is OK.<br>
&gt; 25-feb 19:57 bacula-sd: 3301 Issuing autochanger "loaded? drive
0" command.<br>
&gt; 25-feb 19:57 bacula-sd: 3302 Autochanger "loaded? drive 0",
result is <br>
&gt; Slot 3.<br>
&gt; 25-feb 19:57 bacula-sd: Recycled volume "web-004" on device
"LTO1" <br>
&gt; (/dev/lto1), all previous data lost.<br>
&gt; webfs3: /proc is a different filesystem. Will
not descend from / <br>
&gt; into /proc<br>
&gt; webfs3: /boot is a different filesystem. Will
not descend from / <br>
&gt; into /boot<br>
&gt; webfs3: /dev is a different filesystem. Will not
descend from / <br>
&gt; into /dev<br>
&gt; webfs3: /var/lib/nfs/rpc_pipefs is a different
filesystem. Will not <br>
&gt; descend from / into /var/lib/nfs/rpc_pipefs<br>
&gt; webfs3: /sys is a different filesystem. Will not
descend from / <br>
&gt; into /sys<br>
&gt; webfs3: /uno is a different filesystem. Will not
descend from / <br>
&gt; into /uno<br>
&gt; 26-feb 04:14 bacula-sd: End of Volume "web-004" at 594:6519
on device <br>
&gt; "LTO1" (/dev/lto1). Write of 64512 bytes got -1.<br>
&gt; 26-feb 04:14 bacula-sd: Re-read of last block succeeded.<br>
&gt; 26-feb 04:14 bacula-sd: End of medium on Volume "web-004"
<br>
&gt; Bytes=594,382,602,240 Blocks=9,213,519 at 26-feb-2007 04:14.<br>
&gt; 26-feb 04:14 bacula-dir: Recycled volume "web-005"<br>
&gt; 26-feb 04:14 bacula-sd: 3301 Issuing autochanger "loaded? drive
0" command.<br>
&gt; 26-feb 04:14 bacula-sd: 3302 Autochanger "loaded? drive 0",
result is <br>
&gt; Slot 3.<br>
&gt; 26-feb 04:14 bacula-sd: 3307 Issuing autochanger "unload slot
3, drive <br>
&gt; 0" command.<br>
&gt; 26-feb 04:15 bacula-sd: 3304 Issuing autochanger "load slot 4,
drive 0" <br>
&gt; command.<br>
&gt; 26-feb 04:15 bacula-sd: 3305 Autochanger "load slot 4, drive
0", status <br>
&gt; is OK.<br>
&gt; 26-feb 04:15 bacula-sd: 3301 Issuing autochanger "loaded? drive
0" command.<br>
&gt; 26-feb 04:15 bacula-sd: 3302 Autochanger "loaded? drive 0",
result is <br>
&gt; Slot 4.<br>
&gt; 26-feb 04:15 bacula-sd: Recycled volume "web-005" on device
"LTO1" <br>
&gt; (/dev/lto1), all previous data lost.<br>
&gt; 26-feb 04:15 bacula-sd: New volume "web-005" mounted on
device "LTO1" <br>
&gt; (/dev/lto1) at 26-feb-2007 04:15.<br>
&gt; 26-feb 10:21 bacula-sd: End of Volume "web-005" at 528:6656
on device <br>
&gt; "LTO1" (/dev/lto1). Write of 64512 bytes got -1.<br>
&gt; 26-feb 10:21 bacula-sd: Re-read of last block succeeded.<br>
&gt; 26-feb 10:21 bacula-sd: End of medium on Volume "web-005"
<br>
&gt; Bytes=528,395,664,384 Blocks=8,190,656 at 26-feb-2007 10:21.<br>
&gt; 26-feb 10:21 bacula-dir: Recycled volume "web-006"<br>
&gt; 26-feb 10:21 bacula-sd: 3301 Issuing autochanger "loaded? drive
0" command.<br>
&gt; 26-feb 10:21 bacula-sd: 3302 Autochanger "loaded? drive 0",
result is <br>
&gt; Slot 4.<br>
&gt; 26-feb 10:21 bacula-sd: 3307 Issuing autochanger "unload slot
4, drive <br>
&gt; 0" command.<br>
&gt; 26-feb 10:22 bacula-sd: 3304 Issuing autochanger "load slot 5,
drive 0" <br>
&gt; command.<br>
&gt; 26-feb 10:22 bacula-sd: 3305 Autochanger "load slot 5, drive
0", status <br>
&gt; is OK.<br>
&gt; 26-feb 10:22 bacula-sd: 3301 Issuing autochanger "loaded? drive
0" command.<br>
&gt; 26-feb 10:22 bacula-sd: 3302 Autochanger "loaded? drive 0",
result is <br>
&gt; Slot 5.<br>
&gt; 26-feb 10:23 bacula-sd: Recycled volume "web-006" on device
"LTO1" <br>
&gt; (/dev/lto1), all previous data lost.<br>
&gt; 26-feb 10:23 bacula-sd: New volume "web-006" mounted on
device "LTO1" <br>
&gt; (/dev/lto1) at 26-feb-2007 10:23.<br>
&gt; 26-feb 13:49 bacula-sd: Job write elapsed time = 17:48:45, Transfer
rate <br>
&gt; = 21.65 M bytes/second<br>
&gt; 26-feb 13:49 bacula-sd: Alert: SCSI 2 tape drive:<br>
&gt; 26-feb 13:49 bacula-sd: Alert: File number=267, block number=0, partition=0.<br>
&gt; 26-feb 13:49 bacula-sd: Alert: Tape block size 0 bytes. Density code
<br>
&gt; 0x44 (no translation).<br>
&gt; 26-feb 13:49 bacula-sd: Alert: Soft error count since last status=0<br>
&gt; 26-feb 13:49 bacula-sd: Alert: General status bits on (81010000):<br>
&gt; 26-feb 13:49 bacula-sd: Alert: EOF ONLINE IM_REP_EN<br>
&gt; 26-feb 13:49 bacula-dir: Bacula 2.0.2 (28Jan07): 26-feb-2007 13:49:03<br>
&gt; JobId:
12927<br>
&gt; Job:
webfs3-job.2007-02-25_19.55.40<br>
&gt; Backup Level: Full<br>
&gt; Client:
"webfs3" 2.0.2 (28Jan07) <br>
&gt; i686-redhat-linux-gnu,redhat,Enterprise release<br>
&gt; FileSet: "webfs3-fileset"
2005-04-30 07:13:53<br>
&gt; Pool:
"webfs" (From Job resource)<br>
&gt; Storage: "LTO-1"
(From user selection)<br>
&gt; Scheduled time: 25-feb-2007 19:55:17<br>
&gt; Start time: 25-feb-2007
19:55:46<br>
&gt; End time: 26-feb-2007
13:49:03<br>
&gt; Elapsed time: 17 hours 53
mins 17 secs<br>
&gt; Priority: 10<br>
&gt; FD Files Written: 4,046,880<br>
&gt; SD Files Written: 4,046,880<br>
&gt; FD Bytes Written: 1,387,910,783,372 (1.387
TB)<br>
&gt; SD Bytes Written: 1,388,589,182,436 (1.388
TB)<br>
&gt; Rate:
21552.4 KB/s<br>
&gt; Software Compression: None<br>
&gt; VSS:
no<br>
&gt; Encryption: no<br>
&gt; Volume name(s): web-004|web-005|web-006<br>
&gt; Volume Session Id: 1<br>
&gt; Volume Session Time: 1172427565<br>
&gt; Last Volume Bytes: 266,951,559,168 (266.9
GB)<br>
&gt; Non-fatal FD errors: 0<br>
&gt; SD Errors: 0<br>
&gt; FD termination status: OK<br>
&gt; SD termination status: OK<br>
&gt; Termination: Backup
OK<br>
&gt; <br>
&gt; <br>
&gt; The thing that is not in favour of an hardware or OS problem is that
<br>
&gt; with the same hardware and OS bacula 1.36.3 had not this problem,
it <br>
&gt; arised with 1.38.11.<br>
&gt; The device setup is quite simple:<br>
&gt; <br>
&gt; <br>
&gt; Device {<br>
&gt; Name = LTO1<br>
&gt; Media Type = LTO-3<br>
&gt; Archive Device = /dev/lto1<br>
&gt; AutomaticMount = yes;
# when device opened, read it<br>
&gt; AlwaysOpen = no;<br>
&gt; Autoselect = no<br>
&gt; RemovableMedia = yes;<br>
&gt; RandomAccess = no;<br>
&gt; Changer Command = "/etc/bacula/mtx-changer %c %o %S %a
%d"<br>
&gt; Changer Device = /dev/chg4<br>
&gt; Drive Index = 0<br>
&gt; AutoChanger = yes<br>
&gt; Alert Command = "sh -c 'mt -f %a status'"<br>
&gt; Maximum Network Buffer Size = 65536<br>
&gt; }<br>
&gt; <br>
&gt; Devices /dev/lto1 and /dev/chg4 are symlinks to real devices in order
to <br>
&gt; manage hardware configuration changes.<br>
&gt; <br>
&gt; Thanks again<br>
&gt; <br>
&gt; --------------------------------------------------------------------------<br>
&gt; Ferdinando Pasqualetti<br>
&gt; G.T.Dati srl<br>
&gt; Tel. 0557310862 - 3356172731 - Fax 055720143<br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; *Arno Lehmann &lt;al < at > it...&gt;*<br>
&gt; Inviato da: bacula-users-bounces < at > li...<br>
&gt; <br>
&gt; 26/02/2007 20.33<br>
&gt; <br>
&gt; <br>
&gt; Per<br>
&gt; bacula-users
&lt;bacula-users < at > li...&gt;<br>
&gt; CC<br>
&gt; <br>
&gt; Oggetto<br>
&gt; Re:
[Bacula-users] Change tape problem<br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; Hello,<br>
&gt; <br>
&gt; On 2/26/2007 10:54 AM, Ferdinando Pasqualetti wrote:<br>
&gt; &gt;<br>
&gt; &gt; Hi Bacula users,<br>
&gt; &gt; sorry if you get this message two times, I sent it with
a wrong sender<br>
&gt; &gt; (not in the list), so I am sending it again.<br>
&gt; &gt; I am facing a problem that came out with rev. 1.38.11 (I
never saw it<br>
&gt; &gt; with 1.36.3). The problem did not happen all times, but
very often. Now<br>
&gt; &gt; I switched to 2.0.2 and this problem is much more frequent.<br>
&gt; &gt; The problem is that when a tape was exhausted bacula changes
correctly<br>
&gt; &gt; the tape in the autochanger drive but just after get this
error:<br>
&gt; &gt;<br>
&gt; &gt; 25-feb 02:47 bacula-sd: End of Volume "web-004"
at 594:3362 on device<br>
&gt; &gt; "LTO1" (/dev/lto1). Write of 64512 bytes got
-1.<br>
&gt; &gt; 25-feb 02:47 bacula-sd: Re-read of last block succeeded.<br>
&gt; &gt; 25-feb 02:47 bacula-sd: End of medium on Volume "web-004"<br>
&gt; &gt; Bytes=594,178,937,856 Blocks=9,210,362 at 25-feb-2007 02:47.<br>
&gt; &gt; 25-feb 02:47 bacula-sd: 3301 Issuing autochanger "loaded?
drive 0" <br>
&gt; command.<br>
&gt; &gt; 25-feb 02:47 bacula-sd: 3302 Autochanger "loaded?
drive 0", result is<br>
&gt; &gt; Slot 3.<br>
&gt; &gt; 25-feb 02:47 bacula-sd: 3307 Issuing autochanger "unload
slot 3, drive<br>
&gt; &gt; 0" command.<br>
&gt; &gt; 25-feb 02:48 bacula-sd: 3304 Issuing autochanger "load
slot 4, drive 0"<br>
&gt; &gt; command.<br>
&gt; &gt; 25-feb 02:48 bacula-sd: 3305 Autochanger "load slot
4, drive 0", status<br>
&gt; &gt; is OK.<br>
&gt; &gt; 25-feb 02:48 bacula-sd: 3301 Issuing autochanger "loaded?
drive 0" <br>
&gt; command.<br>
&gt; &gt; 25-feb 02:48 bacula-sd: 3302 Autochanger "loaded?
drive 0", result is<br>
&gt; &gt; Slot 4.<br>
&gt; &gt; 25-feb 02:49 bacula-sd: Wrote label to prelabeled Volume
"web-005" on<br>
&gt; &gt; device "LTO1" (/dev/lto1)<br>
&gt; &gt; 25-feb 02:49 bacula-sd: New volume "web-005"
mounted on device "LTO1"<br>
&gt; &gt; (/dev/lto1) at 25-feb-2007 02:49.<br>
&gt; &gt; 25-feb 02:49 bacula-sd: End of Volume "web-005"
at 0:1 on device "LTO1"<br>
&gt; &gt; (/dev/lto1). Write of 64512 bytes got -1.<br>
&gt; &gt; 25-feb 02:49 bacula-sd: webfs3-job.2007-02-24_20.03.22
Error: Re-read of<br>
&gt; &gt; last block OK, but block numbers differ. Last block=0 Current
<br>
&gt; block=9210362.<br>
&gt; &gt; 25-feb 02:49 bacula-sd: Job write elapsed time = 06:43:26,
Transfer rate<br>
&gt; &gt; = 24.52 M bytes/second<br>
&gt; &gt; 25-feb 02:49 webfs3: webfs3-job.2007-02-24_20.03.22 Fatal
error:<br>
&gt; &gt; backup.c:860 Network send error to SD. ERR=Pipe rotta<br>
&gt; &gt; 25-feb 02:49 bacula-dir: webfs3-job.2007-02-24_20.03.22
Error: Bacula<br>
&gt; &gt; 2.0.2 (28Jan07): 25-feb-2007 02:49:25<br>
&gt; &gt;<br>
&gt; &gt; It seems there are two problems, the first one (and the
most important<br>
&gt; &gt; one) is that bacula get an end of volume on the new tape,<br>
&gt; <br>
&gt; What Bacula reports as an EOT can be caused by a drive error, too,
so<br>
&gt; for the time being I assume that the second error is tightly related
to<br>
&gt; this one.<br>
&gt; <br>
&gt; &gt; and the second<br>
&gt; &gt; one is the difference in the last block (it appears to
be the last block<br>
&gt; &gt; of the previous tape).<br>
&gt; <br>
&gt; If that's the case, and your description seems quite clear, you might<br>
&gt; have found an OS or hardware bug, too.<br>
&gt; <br>
&gt; This is only guesswork, but it could be possible that, after a tape<br>
&gt; change, the hardware or the tape driver don't update their state<br>
&gt; information.<br>
&gt; <br>
&gt; If that's the case, you could try the following:<br>
&gt; - first, have a look at your system log and dmesg output. There might
be<br>
&gt; errors reported there.<br>
&gt; - second, try to reproduce the problem without using Bacula. Unmount
the<br>
&gt; tape drive from bconsole. Load a tape (an unused one, or one with
write<br>
&gt; protection). If you use an empty tape, write some data and some file<br>
&gt; marks to it, ending with an EOT mark. dd and mt are tools for that
purpose.<br>
&gt; Then, use tapeinfo or st to observe the tape status, especialy the
block<br>
&gt; position reported, when doing some rewinds, fast forwards, offline,
and<br>
&gt; see what happens after you used mtx to unload and reload that tape.<br>
&gt; <br>
&gt; If there really is a problem with the hardware or the OD driver, you<br>
&gt; should be able to reproduce it then. Updating the drive firmware and
the<br>
&gt; OS (or, if that's up to date, filing a bug report) would be two options<br>
&gt; then.<br>
&gt; <br>
&gt; Otherwise, you should run btape again, because there are some things
in<br>
&gt; the report I don't like - errors writing the last block to tape should<br>
&gt; not happen with current hardware, for example. You might try to tune<br>
&gt; your device configuration, and perhaps you'll have to set the tape<br>
&gt; driver to a different write mode. Suggesting something is difficult<br>
&gt; without seeing how it's setup now Smile<br>
&gt; <br>
&gt; &gt; Bacula is a MySQL version on a RedHat AS 4.04, rpmbuilt
on that system,<br>
&gt; &gt; an HP proliant G3 3.2 Ghz, 2Gb RAM.<br>
&gt; &gt; The tape is an MSL6000 with two LTO-3 drives, drived by
bacula directly<br>
&gt; &gt; (not using the autochanger as device - 1.36.3 setup).<br>
&gt; &gt; Btape tests run correctly, including the "fill and
change tape" (I am<br>
&gt; &gt; attaching the test result, if someone is interested).<br>
&gt; &gt; Did anyone get a similar problem?<br>
&gt; <br>
&gt; That basic setup should run ok I think... nothing unusual there.<br>
&gt; <br>
&gt; Arno<br>
&gt; <br>
&gt; &gt;<br>
&gt; &gt; <br>
&gt; --------------------------------------------------------------------------<br>
&gt; &gt; Ferdinando Pasqualetti<br>
&gt; &gt; G.T.Dati srl<br>
&gt; &gt; Tel. 0557310862 - 3356172731 - Fax 055720143<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt; ------------------------------------------------------------------------<br>
&gt; &gt;<br>
&gt; &gt; -------------------------------------------------------------------------<br>
&gt; &gt; Take Surveys. Earn Cash. Influence the Future of IT<br>
&gt; &gt; Join SourceForge.net's Techsay panel and you'll get the
chance to <br>
&gt; share your<br>
&gt; &gt; opinions on IT &amp; business topics through brief surveys-and
earn cash<br>
&gt; &gt; http://www.techsay.com/default.php?page=join.php&amp;p=sourceforge&amp;CID=DEVDEV<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt; ------------------------------------------------------------------------<br>
&gt; &gt;<br>
&gt; &gt; _______________________________________________<br>
&gt; &gt; Bacula-users mailing list<br>
&gt; &gt; Bacula-users < at > li...<br>
&gt; &gt; https://lists.sourceforge.net/lists/listinfo/bacula-users<br>
&gt; <br>
&gt; -- <br>
&gt; IT-Service Lehmann
al < at > it...<br>
&gt; Arno Lehmann
http://www.its-lehmann.de<br>
&gt; <br>
&gt; -------------------------------------------------------------------------<br>
&gt; Take Surveys. Earn Cash. Influence the Future of IT<br>
&gt; Join SourceForge.net's Techsay panel and you'll get the chance to
share your<br>
&gt; opinions on IT &amp; business topics through brief surveys-and earn
cash<br>
&gt; http://www.techsay.com/default.php?page=join.php&amp;p=sourceforge&amp;CID=DEVDEV<br>
&gt; _______________________________________________<br>
&gt; Bacula-users mailing list<br>
&gt; Bacula-users < at > li...<br>
&gt; https://lists.sourceforge.net/lists/listinfo/bacula-users<br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; ------------------------------------------------------------------------<br>
&gt; <br>
&gt; -------------------------------------------------------------------------<br>
&gt; Take Surveys. Earn Cash. Influence the Future of IT<br>
&gt; Join SourceForge.net's Techsay panel and you'll get the chance to
share your<br>
&gt; opinions on IT &amp; business topics through brief surveys-and earn
cash<br>
&gt; http://www.techsay.com/default.php?page=join.php&amp;p=sourceforge&amp;CID=DEVDEV<br>
&gt; <br>
&gt; <br>
&gt; ------------------------------------------------------------------------<br>
&gt; <br>
&gt; _______________________________________________<br>
&gt; Bacula-users mailing list<br>
&gt; Bacula-users < at > li...<br>
&gt; https://lists.sourceforge.net/lists/listinfo/bacula-users<br>
<br>
-- <br>
IT-Service Lehmann
al < at > it...<br>
Arno Lehmann http://www.its-lehmann.de<br>
<br>
-------------------------------------------------------------------------<br>
Take Surveys. Earn Cash. Influence the Future of IT<br>
Join SourceForge.net's Techsay panel and you'll get the chance to share
your<br>
opinions on IT &amp; business topics through brief surveys-and earn cash<br>

http://www.techsay.com/default.php?page=join.php&amp;p=sourceforge&amp;CID=DEVDEV<br>
_______________________________________________<br>
Bacula-users mailing list<br>
Bacula-users < at > li...<br>
https://lists.sourceforge.net/lists/listinfo/bacula-users<br>
</tt></font>

Display posts from previous:
Reply to topic Page 1 of 1
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
  


Magic SEO URL for phpBB