SearchFAQMemberlist Log in
Reply to topic Page 2 of 2
Goto page Previous  1, 2
Some info on my experiences with 10GbE
Author Message
Post Some info on my experiences with 10GbE 
in my mind, using 2 Gb/s or 4 Gb/s shouldn't make a bit of difference
for a drive that natively writes at 80 MB/s

It shouldn't make a difference if you're sending uncompressible data to
it. If you send highly compressible data to the drive, then there are 2
places on the drive (that I can think of) that could be bottlenecks -
the FC interface and the compression ASIC:


[ Server ] [ Tape Drive
]
[2Gb FC HBA]----------[2Gb FC]--[Compression ASIC]--[Write
Head]--[Physical Tape]


If the 2Gb FC interface is receiving data at 170MB/s, and the
compression ASIC does 4:1 compression, then the write head will be
sending this compressed data to tape at a speed of 42.5MB/s.

With the 4Gb drives, I'm still not able to push the write head to it's
native max speed of 80MB/s: I'm getting 265MB/s with, with 4:1
compression, so the native write speed is around 66MB/s.

-devon


-----Original Message-----
From: Nick Majeran [mailto:nmajeran < at > gmail.com]
Sent: Friday, October 19, 2007 9:02 AM
To: veritas-bu < at > mailman.eng.auburn.edu; Peters, Devon C
Subject: Re: [Veritas-bu] Some info on my experiences with 10GbE

Regarding the tape drives and compression -- this is the part that
confuses me.

I can max-out an LTO-3 drive at native write speed at 80MB/s with no
problem using pre-compressed data (compressed Sybase dbdumps), even
with a measly 64kb block size. This is using direct NDMP with 2 Gb/s
fc IBM LTO-3 drives.

Using contrived data, i.e. large files dd'ed from /dev/zero or
hpcreatedata, I have in the past maxed out 2 Gb/s LTO-3 drives at
approximately 170 MB/s, as you claim above. However, this was using
256kb block sizes. I have read reports where 2 Gb/s LTO-3 drives can
be pushed to 220-230 MB/s using the maximum block size supported by
LTO-3 (2 MB) and contrived data.

Now, if compression is done at the drive, I would think that with a 2
Gb/s interface, it should be able to receive data at roughly 170 MB/s,
but since the drive natively spins at 80 MB/s, it would compress that
data, 4x, as you claim, to get that 240 MB/s top end. But, in my
mind, using 2 Gb/s or 4 Gb/s shouldn't make a bit of difference for a
drive that natively writes at 80 MB/s.

Does anyone else have experience with this?

Also, I've seen LTO-3 tapes in our environment marked as "FULL" by
Netbackup with close to 2 TB of data on them.

-- nick



Yep, I'm using jumbo frames. The performance was around 50% lower
without it. I'm not currently using any switches for 10GbE, the servers
are connected directly together.

Re 4Gb vs 2Gb tape drives - since the data is compressed at the drive,
we still need to be able to transfer the data to the drives as fast as
possible. The highest throughput we've been able to get with a single
2Gb fibre HBA is about 190MB/s (using multiple 2Gb disk-subsystem ports
zoned to a single HBA port). The highest throughput we've gotten with a
single 2Gb tape drive is 170MB/s. Since this is near the peak we can
get with 2Gb, I assume that the 2Gb interface on the tape drive is
what's limiting our throughput.

Also, we get about 4x compression of this data on the tapes (~1600MB on
an LTO3 tape). So, with 265MB/s at 4x compression, the physical write
speed of the drive is probably somewhere around 65MB/s (265/4). Since
the tape compression ratio has remained the same with both 2Gb and 4Gb
drives, I'd guess that the physical drive speeds with the 2Gb drives
were probably closer to 40MB/s (170/4)...

-devon

_______________________________________________
Veritas-bu maillist - Veritas-bu < at > mailman.eng.auburn.edu
http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu

View user's profile Send private message
Post Re: Some info on my experiences with 10GbE 
I just started testing 2 T2000's with dual 10Gbps SUN Nics directly connected to each other...

I'm somewhat pissed that I'm only able to get about 658Mbps from one thread on the 10Gbps nic while I'm able to get 938Mbps using the onboard 1Gbs nic with when using the iperf default values. Which means in some cases the 10Gbit nic is actually slower than the onboard 1Gbit nic

However, I was able to get 7.2 Gbps using 10 threads.


Here are some of my max results with different thread (-P) values

./iperf -c 192.168.1.2 -f m -w 512K -l 512 -P x


TCP Win Buffer Threads Gbps
512 512 1 1.4
512 512 2 2.5
512 512 4 4.3
512 512 6 6.4
512 512 8 6.1
512 512 10 7.2
512 512 15 4.6
512 512 18 3.6
512 512 20 3
512 512 30 2.5
512 512 60 2.3


Another annoying deal was that the results from iperf were not the same each time I ran the test. The results were as much as 3Gbps different from run to run. The results should be the same for each run.





my /etc/system settings settings that I added as suggested by SUN

set ddi_msix_alloc_limit=8
set ip:ip_soft_rings_cnt=8
set ip:ip_squeue_fanout=1
set ip:tcp_squeue_wput=1
set ip:ip_squeue_bind=0
set ipge:ipge_tx_syncq=1
set ipge:ipge_bcopy_thresh = 512
set ipge:ipge_dvma_thresh = 1
set consistent_coloring=2
set pcie:pcie_aer_ce_mask=0x1




Here are the NDD settings that I found here: http://www.sun.com/servers/coolthreads/tnb/parameters.jsp#2

ndd -set /dev/tcp tcp_conn_req_max_q 16384
ndd -set /dev/tcp tcp_conn_req_max_q0 16384
ndd -set /dev/tcp tcp_max_buf 10485760
ndd -set /dev/tcp tcp_cwnd_max 10485760
ndd -set /dev/tcp tcp_xmit_hiwat 131072
ndd -set /dev/tcp tcp_recv_hiwat 131072
ndd -set /dev/nxge0 accept_jumbo 1

I also found information here: http://blogs.sun.com/sunay/entry/the_solaris_networking_the_magic


7500 MB/s! That’s the most impressive numbers I’ve ever seen by FAR. I may have to take back my “10 GbE is a Lie!” blog post, and I’d be happy to do so.

Can you share things besides the T2000? For example,

what OS and patch levels are you running?
Any IP patches?
Any IP-specific patches?
What ndd settings are you using?
Is rss enabled?

“Input, I need Input!”

---
W. Curtis Preston
Backup Blog < at > www.backupcentral.com
VP Data Protection, GlassHouse Technologies


From: veritas-bu-bounces < at > mailman.eng.auburn.edu [mailto:veritas-bu-bounces < at > mailman.eng.auburn.edu] On Behalf Of Peters, Devon C
Sent: Wednesday, October 17, 2007 12:12 PM
To: VERITAS-BU < at > mailman.eng.auburn.edu
Subject: [Veritas-bu] Some info on my experiences with 10GbE


Since I've seen a little bit of talk about 10GbE on here in the past I figured I'd share some of my experiences...
I've recently been testing some of Sun's dual-port 10GbE NICs on some small T2000's (1Ghz, 4-core). I'm only using a single port on each card, and the servers are currently directly connected to each other (waiting for my network team to get switches and fibre in place).
So far, I've been able to drive throughput between these two systems to about 7500Mbit/sec using iperf. When the throughput gets this high, all the cores/threads on the receiving T2000 become saturated and TCP retransmits start climbing, but both systems remain quite responsive. Since these are only 4-core T2000's, I would guess that the 6 or 8-core T2000's (especially with 1.2Ghz or 1.4Ghz processors) should be capable of more throughput, possibly near line speed.
The down side achieving this high of throughput is that it requires lots of data streams. When transmitting with a single data stream, the most throughput I've gotten is about 1500Mbit/sec. I only got up to 7500Mbit/s when using 64 data streams… Also, the biggest gains seem to be in the jump from 1 to 8 data streams; with 8 streams I was able to get throughput up to 6500Mbit/sec.
Our goal for 10GbE, is to be able to restore data from tape at a speed of at least 2400Mbit/sec (300MB/sec). We have large daily backups (3-4TB) that we would like to be able to restore (not backup) in a reasonable amount of time. These restores are used to refresh our test and development environments with current data. The actual backups are done with array based snapshots (HDS ShadowCopy), which then get mounted and backed up by a dedicated media server (6-core T2000). We're currently getting about 650MB/sec of throughput with the backups (9 streams on 3 LTO3 tape drives - MPX=3 and it's very compressible data).
Going off my iperf results, the restoring this data using 9 streams should get us well over 2400Mbit/sec. But - we haven't installed the cards on our media servers yet, so I have yet to see what the actual performanee of netbackup and LTO3 over 10GbE is. I'm hopeful it'll be close to the iperf results, but if it doesn't meet the goal then we'll be looking at other options.
--
Devon Peters[url][/url]

View user's profile Send private message
Post Some info on my experiences with 10GbE 
Have you tried an aggregate of the 4 onboard 1gb/s NICs?

Paul

--


-----Original Message-----
From: veritas-bu-bounces < at > mailman.eng.auburn.edu
[mailto:veritas-bu-bounces < at > mailman.eng.auburn.edu] On Behalf
Of pancamo
Sent: January 5, 2008 2:43 AM
To: VERITAS-BU < at > mailman.eng.auburn.edu
Subject: [Veritas-bu] Some info on my experiences with 10GbE



I just started testing 2 T2000's with dual 10Gbps SUN Nics
directly connected to each other...

I'm somewhat pissed that I'm only able to get about 658Mbps
from one thread on the 10Gbps nic while I'm able to get
938Mbps using the onboard 1Gbs nic with when using the iperf
default values. Which means in some cases the 10Gbit nic is
actually slower than the onboard 1Gbit nic
====================================================================================

La version française suit le texte anglais.

------------------------------------------------------------------------------------

This email may contain privileged and/or confidential information, and the Bank of
Canada does not waive any related rights. Any distribution, use, or copying of this
email or the information it contains by other than the intended recipient is
unauthorized. If you received this email in error please delete it immediately from
your system and notify the sender promptly by email that you have done so.

------------------------------------------------------------------------------------

Le présent courriel peut contenir de l'information privilégiée ou confidentielle.
La Banque du Canada ne renonce pas aux droits qui s'y rapportent. Toute diffusion,
utilisation ou copie de ce courriel ou des renseignements qu'il contient par une
personne autre que le ou les destinataires désignés est interdite. Si vous recevez
ce courriel par erreur, veuillez le supprimer immédiatement et envoyer sans délai à
l'expéditeur un message électronique pour l'aviser que vous avez éliminé de votre
ordinateur toute copie du courriel reçu.

_______________________________________________
Veritas-bu maillist - Veritas-bu < at > mailman.eng.auburn.edu
http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu

Post Re: Some info on my experiences with 10GbE 
Yes we do that today, however aggregates are sessions based for the most part, so you only end up with 1Gbit for the majority of application.

Have you tried an aggregate of the 4 onboard 1gb/s NICs?

Paul

--


-----Original Message-----
From: veritas-bu-bounces < at > mailman.eng.auburn.edu
[mailto:veritas-bu-bounces < at > mailman.eng.auburn.edu] On Behalf
Of pancamo
Sent: January 5, 2008 2:43 AM
To: VERITAS-BU < at > mailman.eng.auburn.edu
Subject: [Veritas-bu] Some info on my experiences with 10GbE



I just started testing 2 T2000's with dual 10Gbps SUN Nics
directly connected to each other...

I'm somewhat pissed that I'm only able to get about 658Mbps
from one thread on the 10Gbps nic while I'm able to get
938Mbps using the onboard 1Gbs nic with when using the iperf
default values. Which means in some cases the 10Gbit nic is
actually slower than the onboard 1Gbit nic
====================================================================================

La version française suit le texte anglais.

------------------------------------------------------------------------------------

This email may contain privileged and/or confidential information, and the Bank of
Canada does not waive any related rights. Any distribution, use, or copying of this
email or the information it contains by other than the intended recipient is
unauthorized. If you received this email in error please delete it immediately from
your system and notify the sender promptly by email that you have done so.

------------------------------------------------------------------------------------

Le présent courriel peut contenir de l'information privilégiée ou confidentielle.
La Banque du Canada ne renonce pas aux droits qui s'y rapportent. Toute diffusion,
utilisation ou copie de ce courriel ou des renseignements qu'il contient par une
personne autre que le ou les destinataires désignés est interdite. Si vous recevez
ce courriel par erreur, veuillez le supprimer immédiatement et envoyer sans délai à
l'expéditeur un message électronique pour l'aviser que vous avez éliminé de votre
ordinateur toute copie du courriel reçu.

_______________________________________________
Veritas-bu maillist - Veritas-bu < at > mailman.eng.auburn.edu
http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu

View user's profile Send private message
Post Some info on my experiences with 10GbE 
This is another pro-T2000 report. What makes them special?

---
W. Curtis Preston
Backup Blog < at > www.backupcentral.com
VP Data Protection, GlassHouse Technologies

-----Original Message-----
From: veritas-bu-bounces < at > mailman.eng.auburn.edu
[mailto:veritas-bu-bounces < at > mailman.eng.auburn.edu] On Behalf Of pancamo
Sent: Friday, January 04, 2008 11:43 PM
To: VERITAS-BU < at > mailman.eng.auburn.edu
Subject: [Veritas-bu] Some info on my experiences with 10GbE


I just started testing 2 T2000's with dual 10Gbps SUN Nics directly
connected to each other...

I'm somewhat pissed that I'm only able to get about 658Mbps from one
thread on the 10Gbps nic while I'm able to get 938Mbps using the onboard
1Gbs nic with when using the iperf default values. Which means in some
cases the 10Gbit nic is actually slower than the onboard 1Gbit nic

However, I was able to get 7.2 Gbps using 10 threads.


Here are some of my max results with different thread (-P) values

./iperf -c 192.168.1.2 -f m -w 512K -l 512 -P x


TCP Win Buffer Threads Gbps
512 512 1 1.4
512 512 2 2.5
512 512 4 4.3
512 512 6 6.4
512 512 8 6.1
512 512 10 7.2
512 512 15 4.6
512 512 18 3.6
512 512 20 3
512 512 30 2.5
512 512 60 2.3


Another annoying deal was that the results from iperf were not the same
each time I ran the test. The results were as much as 3Gbps different
from run to run. The results should be the same for each run.





my /etc/system settings settings that I added as suggested by SUN

set ddi_msix_alloc_limit=8
set ip:ip_soft_rings_cnt=8
set ip:ip_squeue_fanout=1
set ip:tcp_squeue_wput=1
set ip:ip_squeue_bind=0
set ipge:ipge_tx_syncq=1
set ipge:ipge_bcopy_thresh = 512
set ipge:ipge_dvma_thresh = 1
set consistent_coloring=2
set pcie:pcie_aer_ce_mask=0x1




Here are the NDD settings that I found here:
http://www.sun.com/servers/coolthreads/tnb/parameters.jsp#2

ndd -set /dev/tcp tcp_conn_req_max_q 16384
ndd -set /dev/tcp tcp_conn_req_max_q0 16384
ndd -set /dev/tcp tcp_max_buf 10485760
ndd -set /dev/tcp tcp_cwnd_max 10485760
ndd -set /dev/tcp tcp_xmit_hiwat 131072
ndd -set /dev/tcp tcp_recv_hiwat 131072
ndd -set /dev/nxge0 accept_jumbo 1

I also found information here:
http://blogs.sun.com/sunay/entry/the_solaris_networking_the_magic



cpreston wrote:
7500 MB/s! That's the most impressive numbers I've ever seen by FAR. I
may have to take back my "10 GbE is a Lie!" blog post, and I'd be happy
to do so.

Can you share things besides the T2000? For example,

what OS and patch levels are you running?
Any IP patches?
Any IP-specific patches?
What ndd settings are you using?
Is rss enabled?

"Input, I need Input!"

---
W. Curtis Preston
Backup Blog < at > www.backupcentral.com (http://www.backupcentral.com)
VP Data Protection, GlassHouse Technologies


From: veritas-bu-bounces < at > mailman.eng.auburn.edu
[mailto:veritas-bu-bounces < at > mailman.eng.auburn.edu] On Behalf Of Peters,
Devon C
Sent: Wednesday, October 17, 2007 12:12 PM
To: VERITAS-BU < at > mailman.eng.auburn.edu
Subject: [Veritas-bu] Some info on my experiences with 10GbE


Since I've seen a little bit of talk about 10GbE on here in the past I
figured I'd share some of my experiences...
I've recently been testing some of Sun's dual-port 10GbE NICs on some
small T2000's (1Ghz, 4-core). I'm only using a single port on each card,
and the servers are currently directly connected to each other (waiting
for my network team to get switches and fibre in place).
So far, I've been able to drive throughput between these two systems
to about 7500Mbit/sec using iperf. When the throughput gets this high,
all the cores/threads on the receiving T2000 become saturated and TCP
retransmits start climbing, but both systems remain quite responsive.
Since these are only 4-core T2000's, I would guess that the 6 or 8-core
T2000's (especially with 1.2Ghz or 1.4Ghz processors) should be capable
of more throughput, possibly near line speed.
The down side achieving this high of throughput is that it requires
lots of data streams. When transmitting with a single data stream, the
most throughput I've gotten is about 1500Mbit/sec. I only got up to
7500Mbit/s when using 64 data streams... Also, the biggest gains seem to
be in the jump from 1 to 8 data streams; with 8 streams I was able to
get throughput up to 6500Mbit/sec.
Our goal for 10GbE, is to be able to restore data from tape at a speed
of at least 2400Mbit/sec (300MB/sec). We have large daily backups
(3-4TB) that we would like to be able to restore (not backup) in a
reasonable amount of time. These restores are used to refresh our test
and development environments with current data. The actual backups are
done with array based snapshots (HDS ShadowCopy), which then get mounted
and backed up by a dedicated media server (6-core T2000). We're
currently getting about 650MB/sec of throughput with the backups (9
streams on 3 LTO3 tape drives - MPX=3 and it's very compressible data).
Going off my iperf results, the restoring this data using 9 streams
should get us well over 2400Mbit/sec. But - we haven't installed the
cards on our media servers yet, so I have yet to see what the actual
performanee of netbackup and LTO3 over 10GbE is. I'm hopeful it'll be
close to the iperf results, but if it doesn't meet the goal then we'll
be looking at other options.
--
Devon Peters


+----------------------------------------------------------------------
|This was sent by dan.pancamo < at > aiminvestments.com via Backup Central.
|Forward SPAM to abuse < at > backupcentral.com.
+----------------------------------------------------------------------



_______________________________________________
Veritas-bu maillist - Veritas-bu < at > mailman.eng.auburn.edu
http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu

View user's profile Send private message
Post Re: Some info on my experiences with 10GbE 
[quote="cpreston"]This is another pro-T2000 report. What makes them special?

The Benchmarks...

8 cores and 4 threads for 32 virtual CPUs and soon up to quad 8 cores and 8 thread for 256 virtual CPUs

More here: http://www.sun.com/servers/index.jsp?cat=CoolThreads%20Servers&tab=3&subcat=UltraSPARC%20T2%20and%20T1

View user's profile Send private message
Post Some info on my experiences with 10GbE 
cpreston < at > glasshouse.com said:
This is another pro-T2000 report. What makes them special?

I'll chime in with my additional comments along the lines of the other
responder, who mentioned the large number of cores and threads, etc.
Some of the kernel tunables mentioned have to do with "interrupt fanout"
support for the Sun 10GbE NIC -- so the NIC's interrupts can be handled
by a number of CPU cores in parallel, which is pretty important when
each core isn't all that speedy (in the T2000, anyway). The T2000's
onboard NIC's also take advantage of interrupt fanout, and it's making
its way into Sun's other hardware platforms (both SPARC and x86) as well.

To go along with the driver support, it's recommended that you use the
Solaris "psradm" and "pooladm" commands to arrange for device interrupts
to be limited to only 1 thread per physical core on the T2000. This way
non-interrupt threads won't compete with the device-based threads. I'm
not an expert with this, but I can report that following the recommendations
does make a marked difference in our overall T2000 system throughput even
with only a single GbE interface. Here are a couple references:

http://www.solarisinternals.com/wiki/index.php/Networks#Tuning_Sun_Dual_10GbE_o
n_T1000.2FT2000
http://serversidetechnologies.blogspot.com/

As a clue to my vintage (Smile, I'll say that the T2000 reminds me of the
Sequent Balance and Symmetry computer systems. These had up to 30 little
CPU's in them (NS32032's, or Intel 80386's), as "fast" as 16 or 20MHz.
The systems were not real speedy on a per-Unix-process basis, but they had
a huge amount of bandwidth -- it took quite an effort to bog one down. My
recollection of those days is a bit rusty, but I remember hearing that much
of Sequent's expertise with parallelizing the Unix kernel ended up getting
folded into Solaris/SVR4 back when AT&T and Sun made their grand SVR4/BSD
reunion deal.

Regards,

Marion


_______________________________________________
Veritas-bu maillist - Veritas-bu < at > mailman.eng.auburn.edu
http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu

Post Some info on my experiences with 10GbE 
I don't think it's the T2000's that are so special, I think it may have more to do with Sun's cards…
I recently did some 10G testing with a Sun X4100 w/ 2 Opteron 2216's (dual core, 2.4Ghz) running RHEL 4, and found that it drastically outperformed the 4-core 1Ghz T2000's. I just swapped one of the cards from a T2000 into the X4100, and I tested sending data from the X4100->T2000, and then from the T2000->X4100. When sending data from the X4100->T2000, the throughput was the same as T2000->T2000, but when sending data from T2000->X4100, the maximum throughput achieved was 9.8Gb/s with 16 threads. An interesting observation is that all 4 cores on the X4100 were 50% idle when running at this rate.
Also, worth mentioning is that I did some tests between a couple 8-core 1.2Ghz T2000's, and I got them to 9.3Gb/s with 16 threads, so the 8-core 1.2Ghz T2000's definitly outperform the 4-core 1Ghz ones. Big supprise. Smile
Similar to the previous poster, iperf seemed to perform the best with a 512k buffer and 512k tcp window size, and I also saw some large fluctuations in total throughput results. I'm guessing this is due to where the Solaris scheduler is running threads, since with mpstat I'd occasionally see 1-2 cores (8 cpu's in mpstat) at 99-100% sys, and then the other 2 cores would be 100% idle. If the scheduler would spread the load across the cores more evenly then perhaps more throughput could be achieved. To help smooth the results, all the numbers I've reported on the list are the average of 3 separate 5-minute long iperf runs.
Btw, I'm also finding that the single threaded performance is crap with these cards on the 1Ghz T2000's - though with the 1.2Ghz T2000's or X4100, single threaded performance was slightly better (interestingly, the 8-core T2000's consitently stumbled w/ 3 threads):

No. Threads ________________Mbit/sec________________
X4100->T2000 T2000->X4100 T2000->T2000(8core/1.2Ghz)
1 944 2143 1686
2 1867 3988 1937
3 2558 4772 1897
4 3146 5096 3704
6 4368 8071 5934
8 5468 8282 6908
16 6472 9842 9311
32 6513 9893 9283

-devon

------------------------------------
Date: Mon, 7 Jan 2008 15:33:08 -0500
From: "Curtis Preston" <cpreston < at > glasshouse.com>
Subject: Re: [Veritas-bu] Some info on my experiences with 10GbE
To: <VERITAS-BU < at > mailman.eng.auburn.edu>
Message-ID:
<4FBA0941CF3D9347889AA5FF23A809BE012C8BD9 < at > ghmail02.glasshousetech.com>
Content-Type: text/plain; charset="us-ascii"
This is another pro-T2000 report. What makes them special?
---
W. Curtis Preston
Backup Blog < at > www.backupcentral.com
VP Data Protection, GlassHouse Technologies
-----Original Message-----
From: veritas-bu-bounces < at > mailman.eng.auburn.edu
[mailto:veritas-bu-bounces < at > mailman.eng.auburn.edu ([email]veritas-bu-bounces < at > mailman.eng.auburn.edu[/email])] On Behalf Of pancamo
Sent: Friday, January 04, 2008 11:43 PM
To: VERITAS-BU < at > mailman.eng.auburn.edu
Subject: [Veritas-bu] Some info on my experiences with 10GbE

I just started testing 2 T2000's with dual 10Gbps SUN Nics directly
connected to each other...
I'm somewhat pissed that I'm only able to get about 658Mbps from one
thread on the 10Gbps nic while I'm able to get 938Mbps using the onboard
1Gbs nic with when using the iperf default values. Which means in some
cases the 10Gbit nic is actually slower than the onboard 1Gbit nic
However, I was able to get 7.2 Gbps using 10 threads.

Here are some of my max results with different thread (-P) values
./iperf -c 192.168.1.2 -f m -w 512K -l 512 -P x

TCP Win Buffer Threads Gbps
512 512 1 1.4
512 512 2 2.5
512 512 4 4.3
512 512 6 6.4
512 512 8 6.1
512 512 10 7.2
512 512 15 4.6
512 512 18 3.6
512 512 20 3
512 512 30 2.5
512 512 60 2.3

Another annoying deal was that the results from iperf were not the same
each time I ran the test. The results were as much as 3Gbps different
from run to run. The results should be the same for each run.




my /etc/system settings settings that I added as suggested by SUN
set ddi_msix_alloc_limit=8
set ip:ip_soft_rings_cnt=8
set ip:ip_squeue_fanout=1
set ip:tcp_squeue_wput=1
set ip:ip_squeue_bind=0
set ipge:ipge_tx_syncq=1
set ipge:ipge_bcopy_thresh = 512
set ipge:ipge_dvma_thresh = 1
set consistent_coloring=2
set pcie:pcie_aer_ce_mask=0x1



Here are the NDD settings that I found here:
http://www.sun.com/servers/coolthreads/tnb/parameters.jsp#2
ndd -set /dev/tcp tcp_conn_req_max_q 16384
ndd -set /dev/tcp tcp_conn_req_max_q0 16384
ndd -set /dev/tcp tcp_max_buf 10485760
ndd -set /dev/tcp tcp_cwnd_max 10485760
ndd -set /dev/tcp tcp_xmit_hiwat 131072
ndd -set /dev/tcp tcp_recv_hiwat 131072
ndd -set /dev/nxge0 accept_jumbo 1
I also found information here:
http://blogs.sun.com/sunay/entry/the_solaris_networking_the_magic


cpreston wrote:
7500 MB/s! That's the most impressive numbers I've ever seen by FAR. I
may have to take back my "10 GbE is a Lie!" blog post, and I'd be happy
to do so.

Can you share things besides the T2000? For example,

what OS and patch levels are you running?
Any IP patches?
Any IP-specific patches?
What ndd settings are you using?
Is rss enabled?

"Input, I need Input!"

---
W. Curtis Preston
Backup Blog < at > www.backupcentral.com (http://www.backupcentral.com)
VP Data Protection, GlassHouse Technologies


From: veritas-bu-bounces < at > mailman.eng.auburn.edu
[mailto:veritas-bu-bounces < at > mailman.eng.auburn.edu ([email]veritas-bu-bounces < at > mailman.eng.auburn.edu[/email])] On Behalf Of Peters,
Devon C
Sent: Wednesday, October 17, 2007 12:12 PM
To: VERITAS-BU < at > mailman.eng.auburn.edu
Subject: [Veritas-bu] Some info on my experiences with 10GbE


Since I've seen a little bit of talk about 10GbE on here in the past I
figured I'd share some of my experiences...
I've recently been testing some of Sun's dual-port 10GbE NICs on some
small T2000's (1Ghz, 4-core). I'm only using a single port on each card,
and the servers are currently directly connected to each other (waiting
for my network team to get switches and fibre in place).
So far, I've been able to drive throughput between these two systems
to about 7500Mbit/sec using iperf. When the throughput gets this high,
all the cores/threads on the receiving T2000 become saturated and TCP
retransmits start climbing, but both systems remain quite responsive.
Since these are only 4-core T2000's, I would guess that the 6 or 8-core
T2000's (especially with 1.2Ghz or 1.4Ghz processors) should be capable
of more throughput, possibly near line speed.
The down side achieving this high of throughput is that it requires
lots of data streams. When transmitting with a single data stream, the
most throughput I've gotten is about 1500Mbit/sec. I only got up to
7500Mbit/s when using 64 data streams... Also, the biggest gains seem to
be in the jump from 1 to 8 data streams; with 8 streams I was able to
get throughput up to 6500Mbit/sec.
Our goal for 10GbE, is to be able to restore data from tape at a speed
of at least 2400Mbit/sec (300MB/sec). We have large daily backups
(3-4TB) that we would like to be able to restore (not backup) in a
reasonable amount of time. These restores are used to refresh our test
and development environments with current data. The actual backups are
done with array based snapshots (HDS ShadowCopy), which then get mounted
and backed up by a dedicated media server (6-core T2000). We're
currently getting about 650MB/sec of throughput with the backups (9
streams on 3 LTO3 tape drives - MPX=3 and it's very compressible data).
Going off my iperf results, the restoring this data using 9 streams
should get us well over 2400Mbit/sec. But - we haven't installed the
cards on our media servers yet, so I have yet to see what the actual
performanee of netbackup and LTO3 over 10GbE is. I'm hopeful it'll be
close to the iperf results, but if it doesn't meet the goal then we'll
be looking at other options.
--
Devon Peters

View user's profile Send private message
Display posts from previous:
Reply to topic Page 2 of 2
Goto page Previous  1, 2
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
  


Magic SEO URL for phpBB