SearchFAQMemberlist Log in
Reply to topic Page 1 of 1
Having some trouble using amanada 2.6.1p2 on OpenSolaris
Author Message
Post Having some trouble using amanada 2.6.1p2 on OpenSolaris 
Hi list.

We're trying backup snapshots of some zfs filesystems on a OpenSolaris host.
Client is running amanda 2.6.1p2, server is running 2.5.3 (with some kerberos
fixes added) on a Solaris 10.

Backup of this client is via ssh and keys. The client is not build with kerberos.
Backup is with gnutar (/Usr/sfw/bin/gtar).
ssh is Solaris ssh.
No firewalls are involved.


However the client fails o communicate with the server. It seems to calculate the size right.
This is from the amandad.debug file:


OPTIONS features=ffffffff9ffeffffffff7f;
/.zfs/snapshot/daily.0 1 SIZE 39529810
/.zfs/snapshot/daily.0 2 SIZE 39529810
/opt/.zfs/snapshot/daily.0 1 SIZE 3968010
/opt/.zfs/snapshot/daily.0 2 SIZE 3968010
/sandbox/.zfs/snapshot/daily.0 1 SIZE 10
/sandbox/.zfs/snapshot/daily.0 2 SIZE 10
/sandbox/klas/.zfs/snapshot/daily.0 1 SIZE 71922300
/sandbox/klas/.zfs/snapshot/daily.0 2 SIZE 71922300
/sandbox/klas2/.zfs/snapshot/daily.0 1 SIZE 38611950
1289263081.214946: amandad: security_stream_seterr(8074e80, write error to: Bad file number)
1289263081.214961: amandad: security_seterror(handle=8074d30, driver=fef3a688 (SSH) error=write error to: Bad file number)
1289263137.443175: amandad: sending PREP pkt:
<<<<<


(I notice that last fs is not repeated like the other ones are.)

The server will log:


client /sandbox/klas2/.zfs/snapshot/daily.0 lev 0 FAILED [disk /sandbox/klas2/.zfs/snapshot/daily.0, all estimate timed out]
client
/sandbox/klas/.zfs/snapshot/daily.0 lev 0 FAILED [disk /sandbox/klas/.zfs/snapshot/daily.0, all estimate timed out]
client
/sandbox/.zfs/snapshot/daily.0 lev 0 FAILED [disk /sandbox/.zfs/snapshot/daily.0, all estimate timed out]
client
/opt/.zfs/snapshot/daily.0 lev 0 FAILED [disk /opt/.zfs/snapshot/daily.0, all estimate timed out]
client /.zfs/snapshot/daily.0 lev 0 FAILED [disk /.zfs/snapshot/daily.0, all estimate timed out]


I guess something is broken in the ssh communication between server and client.

Has anyone seen anything like this before and have some idea about what is going wrong?
Could it be real timeouts when calculations take more time then expected?



Klas Heggemann
sys adm at csc kth .se

Post Having some trouble using amanada 2.6.1p2 on OpenSolaris 
What's the time difference between the first and last line in the
amandad.debug file?
Is it bigger than your etimeout setting? If so, increase etimeout

Jean-Louis

Klas Heggemann wrote:
Hi list.

We're trying backup snapshots of some zfs filesystems on a OpenSolaris host.
Client is running amanda 2.6.1p2, server is running 2.5.3 (with some kerberos
fixes added) on a Solaris 10.

Backup of this client is via ssh and keys. The client is not build with kerberos.
Backup is with gnutar (/Usr/sfw/bin/gtar).
ssh is Solaris ssh.
No firewalls are involved.


However the client fails o communicate with the server. It seems to calculate the size right.
This is from the amandad.debug file:


OPTIONS features=ffffffff9ffeffffffff7f;
/.zfs/snapshot/daily.0 1 SIZE 39529810
/.zfs/snapshot/daily.0 2 SIZE 39529810
/opt/.zfs/snapshot/daily.0 1 SIZE 3968010
/opt/.zfs/snapshot/daily.0 2 SIZE 3968010
/sandbox/.zfs/snapshot/daily.0 1 SIZE 10
/sandbox/.zfs/snapshot/daily.0 2 SIZE 10
/sandbox/klas/.zfs/snapshot/daily.0 1 SIZE 71922300
/sandbox/klas/.zfs/snapshot/daily.0 2 SIZE 71922300
/sandbox/klas2/.zfs/snapshot/daily.0 1 SIZE 38611950

1289263081.214946: amandad: security_stream_seterr(8074e80, write error to: Bad file number)
1289263081.214961: amandad: security_seterror(handle=8074d30, driver=fef3a688 (SSH) error=write error to: Bad file number)
1289263137.443175: amandad: sending PREP pkt:
<<<<<


(I notice that last fs is not repeated like the other ones are.)

The server will log:


client /sandbox/klas2/.zfs/snapshot/daily.0 lev 0 FAILED [disk /sandbox/klas2/.zfs/snapshot/daily.0, all estimate timed out]
client
/sandbox/klas/.zfs/snapshot/daily.0 lev 0 FAILED [disk /sandbox/klas/.zfs/snapshot/daily.0, all estimate timed out]
client
/sandbox/.zfs/snapshot/daily.0 lev 0 FAILED [disk /sandbox/.zfs/snapshot/daily.0, all estimate timed out]
client
/opt/.zfs/snapshot/daily.0 lev 0 FAILED [disk /opt/.zfs/snapshot/daily.0, all estimate timed out]
client /.zfs/snapshot/daily.0 lev 0 FAILED [disk /.zfs/snapshot/daily.0, all estimate timed out]


I guess something is broken in the ssh communication between server and client.

Has anyone seen anything like this before and have some idea about what is going wrong?
Could it be real timeouts when calculations take more time then expected?



Klas Heggemann
sys adm at csc kth .se





Post Having some trouble using amanada 2.6.1p2 on OpenSolaris 
9 nov 2010 kl. 12.33 skrev Jean-Louis Martineau:

What's the time difference between the first and last line in the amandad.debug file?
Is it bigger than your etimeout setting? If so, increase etimeout

Tnx.

Format seems to be
seconds.milliseconds message
and diff between last and first line is 6522 seconds, which seems like a too long time to wait for a estimate value.

How do I dryrun this, i e running a gtar with the same flags, to compute the size of a backup?

/Klas




Jean-Louis

Klas Heggemann wrote:
Hi list.

We're trying backup snapshots of some zfs filesystems on a OpenSolaris host. Client is running amanda 2.6.1p2, server is running 2.5.3 (with some kerberos fixes added) on a Solaris 10.

Backup of this client is via ssh and keys. The client is not build with kerberos.
Backup is with gnutar (/Usr/sfw/bin/gtar). ssh is Solaris ssh. No firewalls are involved.


However the client fails o communicate with the server. It seems to calculate the size right.
This is from the amandad.debug file:

OPTIONS features=ffffffff9ffeffffffff7f;
/.zfs/snapshot/daily.0 1 SIZE 39529810
/.zfs/snapshot/daily.0 2 SIZE 39529810
/opt/.zfs/snapshot/daily.0 1 SIZE 3968010
/opt/.zfs/snapshot/daily.0 2 SIZE 3968010
/sandbox/.zfs/snapshot/daily.0 1 SIZE 10
/sandbox/.zfs/snapshot/daily.0 2 SIZE 10
/sandbox/klas/.zfs/snapshot/daily.0 1 SIZE 71922300
/sandbox/klas/.zfs/snapshot/daily.0 2 SIZE 71922300
/sandbox/klas2/.zfs/snapshot/daily.0 1 SIZE 38611950
1289263081.214946: amandad: security_stream_seterr(8074e80, write error to: Bad file number)
1289263081.214961: amandad: security_seterror(handle=8074d30, driver=fef3a688 (SSH) error=write error to: Bad file number)
1289263137.443175: amandad: sending PREP pkt:
<<<<<


(I notice that last fs is not repeated like the other ones are.)
The server will log:


client /sandbox/klas2/.zfs/snapshot/daily.0 lev 0 FAILED [disk /sandbox/klas2/.zfs/snapshot/daily.0, all estimate timed out]
client /sandbox/klas/.zfs/snapshot/daily.0 lev 0 FAILED [disk /sandbox/klas/.zfs/snapshot/daily.0, all estimate timed out]
client /sandbox/.zfs/snapshot/daily.0 lev 0 FAILED [disk /sandbox/.zfs/snapshot/daily.0, all estimate timed out]
client /opt/.zfs/snapshot/daily.0 lev 0 FAILED [disk /opt/.zfs/snapshot/daily.0, all estimate timed out]
client /.zfs/snapshot/daily.0 lev 0 FAILED [disk /.zfs/snapshot/daily.0, all estimate timed out]


I guess something is broken in the ssh communication between server and client.
Has anyone seen anything like this before and have some idea about what is going wrong? Could it be real timeouts when calculations take more time then expected?



Klas Heggemann
sys adm at csc kth .se






Display posts from previous:
Reply to topic Page 1 of 1
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
  


Magic SEO URL for phpBB