l Increase the allocation size of Driver Resources (transmit/receive buffers). However, most TCP traffic
patterns work best with the transmit buffer set to its default value, and the receive buffer set to its min-
imum value.
l When passing traffic on multiple network ports using an I/O application that runs on most or all of the
cores in your system, consider setting the CPU Affinity for that application to fewer cores. This should
reduce CPU utilization and in some cases may increase throughput for the device. The cores selected
for CPU Affinity must be local to the affected network device's Processor Node/Group. You can use
the PowerShell command Get-NetAdapterRSS to list the cores that are local to a device. You may
need to increase the number of cores assigned to the application to maximize throughput. Refer to your
operating system documentation for more details on setting the CPU Affinity.
l If you have multiple 10 Gpbs (or faster) ports installed in a system, the RSS queues of each adapter
port can be adjusted to use non-overlapping sets of processors within the adapter's local NUMA
Node/Socket. Change the RSS Base Processor Number for each adapter port so that the combination
of the base processor and the max number of RSS processors settings ensure non-overlapping cores.
1. Identify the adapter ports to be adjusted and inspect at their RssProcessorArray using the
Get-NetAdapterRSS PowerShell cmdlet.
2. Identify the processors with NUMA distance 0. These are the cores in the adapter's local
NUMA Node/Socket and will provide the best performance.
3. Adjust the RSS Base processor on each port to use a non-overlapping set of processors within
the local set of processors. You can do this manually or using the following PowerShell com-
mand:
Set-NetAdapterAdvancedProperty -Name <Adapter Name> -DisplayName
"RSS Base
Processor Number" -DisplayValue <RSS Base Proc Value>
4. Use the Get-NetAdpaterAdvancedproperty cmdlet to check that the right values have been set:
Get-NetAdpaterAdvancedproperty -Name <Adapter Name>
For Example: For a 4 port adapter with Local processors 0, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22,
24, 26, 28, 30, and 'Max RSS processor' of 8, set the RSS base processors to 0, 8, 16 and 24.
Optimized for quick response and low latency
l Minimize or disable Interrupt Moderation Rate.
l Disable Offload TCP Segmentation.
l Disable Jumbo Packets.
l Increase Transmit Descriptors.
l Increase Receive Descriptors.
l Increase RSS Queues.
Optimized for throughput
l Enable Jumbo Packets.
l Increase Transmit Descriptors.
l Increase Receive Descriptors.
l On systems that support NUMA, set the Preferred NUMA Node on each adapter to achieve better scal-
ing across NUMA nodes.