SearchFAQMemberlist Log in
Reply to topic Page 1 of 1
network error job aborted
Author Message
Post network error job aborted 
Hello,
I've got a bacula win32 2.03 client that has been backing up fine. This
morning it didn't and gave me a fatal network error. I'm open to
suggestions, i've confirmed that all the daemons are still running. Here's
an excerpt from my bacula log.

18-Jun 04:38 titan-fd: Generate VSS snapshots. Driver="VSS WinXP",
Drive(s)="C"
18-Jun 04:40 zeus-dir: backup_titan.2007-06-18_04.35.00 Fatal error: Network
error with FD during Backup: ERR=Connection reset by peer
18-Jun 04:40 zeus-sd: backup_titan.2007-06-18_04.35.00 Fatal error:
append.c:259 Network error on data channel. ERR=Connection reset by peer
18-Jun 04:40 zeus-sd: Job write elapsed time = 00:05:48, Transfer rate =
615.2 K bytes/second
18-Jun 04:40 zeus-dir: backup_titan.2007-06-18_04.35.00 Fatal error: No Job
status returned from FD.

Thanks.
Dave.

Post network error job aborted 
what does:

"status client=backup_titan" show?

Did you restart the bacula-fd service on the client?

Check the windows event viewer?

Re-start the bacula-fd.exe process from command line manually with -d99
-f -v ?

~BAS

On Mon, 2007-06-18 at 07:41 -0400, Dave wrote:
zeus-dir
--
Brian A. Seklecki <bseklecki < at > co...>
Collaborative Fusion, Inc.

IMPORTANT: This message contains confidential information and is intended only for the individual named. If the reader of this message is not an intended recipient (or the individual responsible for the delivery of this message to an intended recipient), please be advised that any re-use, dissemination, distribution or copying of this message is prohibited. Please notify the sender immediately by e-mail if you have received this e-mail by mistake and delete this e-mail from your system.

Post network error job aborted 
Dave wrote:
Hello,
I've got a bacula win32 2.03 client that has been backing up fine. This
morning it didn't and gave me a fatal network error. I'm open to
suggestions, i've confirmed that all the daemons are still running. Here's
an excerpt from my bacula log.

18-Jun 04:38 titan-fd: Generate VSS snapshots. Driver="VSS WinXP",
Drive(s)="C"
18-Jun 04:40 zeus-dir: backup_titan.2007-06-18_04.35.00 Fatal error: Network
error with FD during Backup: ERR=Connection reset by peer
18-Jun 04:40 zeus-sd: backup_titan.2007-06-18_04.35.00 Fatal error:
append.c:259 Network error on data channel. ERR=Connection reset by peer
18-Jun 04:40 zeus-sd: Job write elapsed time = 00:05:48, Transfer rate =
615.2 K bytes/second
18-Jun 04:40 zeus-dir: backup_titan.2007-06-18_04.35.00 Fatal error: No Job
status returned from FD.

I actually just had this same problem start up a few weeks ago! I haven't
found a solution yet, but I can at least give you some suggestions.

http://thread.gmane.org/gmane.comp.sysutils.backup.bacula.general/35287

The first thing you want to do, is check for any network related problems.
Errors on port, duplex mismatches, firewalls in between, that sort of stuff.
You can also try enabling the heartbeat options. If it is a problem with a
device in between screwing up a state table, the heartbeat option should help.

Also, if you can grab a tcpdump (headers only) of the problem occurring, I'd
love to compare it against the one I had (the url is in that thread if you'd
like to take a look).

Whether you find a solution, or end up filing a bug report, please let me
know, as I'm still having the problem myself...

--
Frank Sweetser fs at wpi.edu | For every problem, there is a solution that
WPI Senior Network Engineer | is simple, elegant, and wrong. - HL Mencken
GPG fingerprint = 6174 1257 129E 0D21 D8D4 E8A3 8E39 29E3 E2E8 8CEC

Post network error job aborted 
Hello,
I really hate it when i try to respond to questions and when i do bacula
does not duplicate the error.
Brian, a status on the client showed that it never went down, i
restarted it manually with the flags you indicated and nothing more verbose
came through. Like Frank i'm also not convinced it's networking, this box
and the server do have firewalls, but they pass bacula traffic without
issues.
All of the daemons involved are 2.03, i'm not using any 2.1.x bacula on
this client.
Windows event viewer didn't tell me anything, i saw two errors around
the time in question, but i had a user on the box unbenounced to me playing
an online game, error came from one of the dlls it used. My first
speculation is that error that snafu whatever corrupted the vss snapshot, is
this possible or how could i confirm this?
The other thing that it might be although i doubt this as well is a
conflict between running jobs. The job in question started at 4:35, and died
at 5:47. I had another job, backing up to another disk volume totally
separate, kick off two minutes before the failure at 5:45. That client was a
Unix not windows box. It worked fine, but i don't like not knowing the why,
i suspect i will see more of this from this client.
Thanks.
Dave.

----- Original Message -----
From: "Brian A. Seklecki" <bseklecki < at > co...>
To: "Dave" <dmehler26 < at > wo...>
Cc: <bacula-users < at > li...>
Sent: Monday, June 18, 2007 8:06 AM
Subject: Re: [Bacula-users] network error job aborted


what does:

"status client=backup_titan" show?

Did you restart the bacula-fd service on the client?

Check the windows event viewer?

Re-start the bacula-fd.exe process from command line manually with -d99
-f -v ?

~BAS

On Mon, 2007-06-18 at 07:41 -0400, Dave wrote:
zeus-dir
--
Brian A. Seklecki <bseklecki < at > co...>
Collaborative Fusion, Inc.




IMPORTANT: This message contains confidential information and is intended
only for the individual named. If the reader of this message is not an
intended recipient (or the individual responsible for the delivery of this
message to an intended recipient), please be advised that any re-use,
dissemination, distribution or copying of this message is prohibited.
Please notify the sender immediately by e-mail if you have received this
e-mail by mistake and delete this e-mail from your system.


Post network error job aborted 
Could be a stateful packet filter (firewall) timeout. Try setting your
TCP socket timeout to an hour or so. But which socket?

In cisco it's:

# timeout conn 12:00:00 half-closed 0:10:00 udp 0:03:00 icmp 0:00:02

In OpenBSD it's:

set timeout tcp.established 86400

Someone should document that.

~BAS

FD->SD? DIR->FD? Your error indicated that either that DIR lost
connection to the FD, or the FD lost connection to the SD.

On Mon, 2007-06-18 at 12:49 -0400, Dave wrote:
Hello,
I really hate it when i try to respond to questions and when i do bacula
does not duplicate the error.
Brian, a status on the client showed that it never went down, i
restarted it manually with the flags you indicated and nothing more verbose
came through. Like Frank i'm also not convinced it's networking, this box
and the server do have firewalls, but they pass bacula traffic without
issues.
All of the daemons involved are 2.03, i'm not using any 2.1.x bacula on
this client.
Windows event viewer didn't tell me anything, i saw two errors around
the time in question, but i had a user on the box unbenounced to me playing
an online game, error came from one of the dlls it used. My first
speculation is that error that snafu whatever corrupted the vss snapshot, is
this possible or how could i confirm this?
The other thing that it might be although i doubt this as well is a
conflict between running jobs. The job in question started at 4:35, and died
at 5:47. I had another job, backing up to another disk volume totally
separate, kick off two minutes before the failure at 5:45. That client was a
Unix not windows box. It worked fine, but i don't like not knowing the why,
i suspect i will see more of this from this client.
Thanks.
Dave.

----- Original Message -----
From: "Brian A. Seklecki" <bseklecki < at > co...>
To: "Dave" <dmehler26 < at > wo...>
Cc: <bacula-users < at > li...>
Sent: Monday, June 18, 2007 8:06 AM
Subject: Re: [Bacula-users] network error job aborted



what does:

"status client=backup_titan" show?

Did you restart the bacula-fd service on the client?

Check the windows event viewer?

Re-start the bacula-fd.exe process from command line manually with -d99
-f -v ?

~BAS

On Mon, 2007-06-18 at 07:41 -0400, Dave wrote:
zeus-dir
--
Brian A. Seklecki <bseklecki < at > co...>
Collaborative Fusion, Inc.




IMPORTANT: This message contains confidential information and is intended
only for the individual named. If the reader of this message is not an
intended recipient (or the individual responsible for the delivery of this
message to an intended recipient), please be advised that any re-use,
dissemination, distribution or copying of this message is prohibited.
Please notify the sender immediately by e-mail if you have received this
e-mail by mistake and delete this e-mail from your system.








--
Brian A. Seklecki <bseklecki < at > co...>
Collaborative Fusion, Inc.

IMPORTANT: This message contains confidential information and is intended only for the individual named. If the reader of this message is not an intended recipient (or the individual responsible for the delivery of this message to an intended recipient), please be advised that any re-use, dissemination, distribution or copying of this message is prohibited. Please notify the sender immediately by e-mail if you have received this e-mail by mistake and delete this e-mail from your system.

Display posts from previous:
Reply to topic Page 1 of 1
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
  


Magic SEO URL for phpBB