Multi-core network packet steering: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 13:47, 11 July 2025 edit InkySka (talk \| contribs) 40 edits m Minor rephrasing ← Previous edit		Latest revision as of 20:05, 8 August 2025 edit undo Citation bot (talk \| contribs) Bots 5,865,967 edits Added hdl. \| Use this bot. Report bugs. \| Suggested by Headbomb \| #UCB_toolbar
(11 intermediate revisions by 7 users not shown)
Line 1: {{Short description\|Network packet distribution with multiple cores}} [[Network packet]] steering of transmitting and receiving traffic for [[Multi-core_processor\|multi-core architectures]] is needed in modern network computing environment, especially in [[Data_center\|data centers]], where the high bandwidth and heavy loads would easily congestion a single core's [[Queueing theory\|queue]].<ref name="RSS++">{{Cite journal \|last=Barbette \|first=Tom \|last2=Katsikas \|first2=Georgios P. \|last3=Maguire \|first3=Gerald Q. \|last4=Kostić \|first4=Dejan \|date=2019-12-03 \|title=RSS++: load and state-aware receive side scaling \|url=https://dl.acm.org/doi/10.1145/3359989.3365412 \|journal=Proceedings of the 15th International Conference on Emerging Networking Experiments And Technologies \|series=CoNEXT '19 \|___location=New York, NY, USA \|publisher=Association for Computing Machinery \|doi=10.1145/3359989.3365412 \|isbn=978-1-4503-6998-5}}</ref>▼ {{copyedit\|reason=an encylopedic tone in the lead section\|date=July 2025}} ▲[[Network packet]] steering of ~~transmitting~~transmitted and ~~receiving~~received traffic for [[Multi-core_processor\|multi-core architectures]] is needed in modern network computing environment, especially in [[Data_center\|data centers]], where the high bandwidth and heavy loads would easily congestion a single core's [[Queueing theory\|queue]].<ref name="RSS++">{{Cite ~~journal~~book \|~~last~~last1=Barbette \|~~first~~first1=Tom \|last2=Katsikas \|first2=Georgios P. \|last3=Maguire \|first3=Gerald Q. \|last4=Kostić \|first4=Dejan \|~~date=2019-12-03 \|title~~chapter=RSS++: ~~load~~Load and state-aware receive side scaling \|~~url~~date=~~https://dl.acm.org/doi/10.1145/3359989.3365412~~2019-12-03 \|~~journal~~title=Proceedings of the 15th International Conference on Emerging Networking Experiments ~~And~~and Technologies \|~~series~~chapter-url=~~CoNEXT~~https://dl.acm.org/doi/10.1145/3359989.3365412 ~~'19~~\|pages=318–333 \|___location=New York, NY, USA \|publisher=Association for Computing Machinery \|doi=10.1145/3359989.3365412 \|hdl=2078.1/262641 \|isbn=978-1-4503-6998-5 }}</ref> [[File:Simple NIC and cores architecture.png\|thumb\|upright=1.7\|Simple graph showing the path receiving packets need to travel to reach the cores' queues]] For this reason many techniques, both in hardware and in software, are leveraged in order to distribute the incoming load of packets across the cores of the [[Central processing unit\|processor]]. ~~For~~On ~~receiving~~the traffic-receiving side, the most notable techniques ~~seen~~presented in this article are: RSS, aRFS, RPS and RFS. ~~While for~~For transmission, ~~XPS~~we will befocus ~~explained~~on XPS.<ref name="General intro">{{Citation \|last=Madden \|first=Michael M. \|title=Challenges Using the Linux Network Stack for Real-Time Communication \|date=2019-01-06 \|work=AIAA Scitech 2019 Forum \|url=https://arc.aiaa.org/doi/10.2514/6.2019-0503 \|access-date=2025-07-10 \|series=AIAA SciTech Forum \|publisher=American Institute of Aeronautics and Astronautics \|doi=10.2514/6.2019-0503 \|pages=99–11 \|isbn=978-111-62410-578-4 \|url-access=subscription }}</ref><ref>{{Cite web \|last=Herbert \|first=Tom \|date=2025-02-24 \|title=The alphabet soup of receive packet steering: RSS, RPS, RFS, and aRFS \|url=https://medium.com/@tom_84912/the-alphabet-soup-of-receive-packet-steering-rss-rps-rfs-and-arfs-c84347156d68 \|access-date=2025-07-10 \|website=Medium \|language=en}}</ref><br> As shown by the figure beside, packets coming into the [[Network_interface_controller\|network interface card (NIC)]] are processed and loaded to the receiving queues managed by the cores (which are usually implemented as [[Circular buffer\|ring buffers]] within the [[User space and kernel space\|kernel space]]). The main objective is being able to leverage all the cores available within the [[Central processing unit\|CPU]] to process incoming packets, while also improving performances like [[Latency (engineering)\|latency]] and [[Network throughput\|throughput]].<ref name="RSS kernel linux docs">{{Cite web\|title=RSS kernel linux docs\|url=https://www.kernel.org/doc/html/v5.1/networking/scaling.html#rss-receive-side-scaling\|access-date=2025-07-08\|website=kernel.org\|publisher=The Linux Kernel documentation\|language=en-US}}</ref><ref name="RSS overview by microsoft">{{Cite web\|title=RSS overview by microsoft\|url=https://learn.microsoft.com/en-us/windows-hardware/drivers/network/introduction-to-receive-side-scaling\|access-date=2025-07-08\|website=learn.microsoft.com\|language=en-US}}</ref><ref>{{Cite journal \|~~last~~last1=Wu \|~~first~~first1=Wenji \|last2=DeMar \|first2=Phil \|last3=Crawford \|first3=Matt \|date=2011-02-01 \|title=Why Can Some Advanced Ethernet NICs Cause Packet Reordering? ~~\|url=https://ieeexplore.ieee.org/document/5673999/~~ \|journal=IEEE Communications Letters \|volume=15 \|issue=2 \|pages=253–255 \|doi=10.1109/LCOMM.2011.122010.102022 \|arxiv=1106.0443 \|bibcode=2011IComL..15..253W \|issn=1558-2558}}</ref> == Hardware techniques == Hardware accelerated techniques like RSS and aRFS are used to route and load balance incoming [[Network_packet\|packets]] across the multiple cores' queues of a processor.<ref name="RSS++" /><br> Those hardware supported methods achieve extremely low latencies and reduce the load on the CPU, as compared to the software based ones. However they require a specialized hardware integrated within the [[Network_interface_controller\|network interface controller]] (which, for example, is usually available on more advanced cards, like the [[Data_processing_unit\|SmartNIC]]).<ref name="aRFS by redhat">{{Cite web\|title=aRFS by redhat\|url=https://docs.redhat.com/en/documentation/red_hat_enterprise_linux/6/html/performance_tuning_guide/network-acc-rfs\|access-date=2025-07-08\|website=docs.redhat.com\|publisher=Red Hat Documentation\|language=en-US}}</ref><ref name="aRFS by nvidea">{{Cite web\|title=aRFS by nvidea\|url=https://docs.nvidia.com/networking/display/mlnxofedv23070512/flow+steering#src-2396583156_safe-id-Rmxvd1N0ZWVyaW5nLUFjY2VsZXJhdGVkUmVjZWl2ZUZsb3dTdGVlcmluZyhhUkZTKQ\|access-date=2025-07-08\|website=docs.nvidia.com\|publisher=NVIDIA Documentation Hub\|language=en-US}}</ref> === RSS === Line 28 ⟶ 31: == Software techniques == Software techniques like RPS and RFS employ one of the CPU cores to steer incoming packets across the other cores of the processor. This comes at the cost of introducing additional [[Inter-processor interrupt\|inter-processor interrupts (IPIs)]]; however the number of hardware interrupts will not increase and potentially, by employing an [[Interrupt coalescing\|interrupt aggregation]] technique, it could even be reduced.<ref name="RPS kernel linux docs">{{Cite web\|title=RPS kernel linux docs\|url=https://www.kernel.org/doc/html/v5.1/networking/scaling.html#rps-receive-packet-steering\|access-date=2025-07-08\|website=kernel.org\|publisher=The Linux Kernel documentation\|language=en-US}}</ref><br> The benefits of a software solutions is the ease in implementation, without having to change any component (like the [[Network_interface_controller\|NIC]]) of the currently used architecture, but by simply deploying the proper [[Loadable kernel module\|kernel module]]. This benefit can be crucial especially in cases where the server machine can't be customized or accessed (like in [[Cloud computing#Infrastructure as a service (IaaS)\|cloud computing]] environment), even if the network performances could be reduced as compared the hardware supported ones.<ref name="RPS linux news (LWM)">{{Cite web\|last1=Corbet \|first1=Jonathan \|title=RPS linux news (LWM)\|url=https://lwn.net/Articles/362339/\|access-date=2025-07-08\|website=lwn.net\|date=17 November 2009 \|publisher=Linux Weekly News\|language=en-US}}</ref><ref name="RPS by redhat">{{Cite web\|title=RPS by redhat\|url=https://docs.redhat.com/en/documentation/red_hat_enterprise_linux/6/html/performance_tuning_guide/network-rps\|access-date=2025-07-08\|website=docs.redhat.com\|publisher=Red Hat Documentation\|language=en-US}}</ref><ref name="RFS by nvidea" /> === RPS === Line 65 ⟶ 68: == Further readings == * {{Cite ~~journal~~book \|~~last~~last1=Enberg \|~~first~~first1=Pekka \|last2=Rao \|first2=Ashwin \|last3=Tarkoma \|first3=Sasu \|~~date=2019-12-09 \|title~~chapter=Partition-Aware Packet Steering Using XDP and eBPF for Improving Application-Level Parallelism \|~~url~~date=~~https://dl.acm.org/doi/10.1145/3359993.3366766~~2019-12-09 \|~~journal~~title=Proceedings of the 1st ACM CoNEXT Workshop on Emerging in-Network Computing Paradigms \|~~series~~chapter-url=~~ENCP~~https://dl.acm.org/doi/10.1145/3359993.3366766 ~~'19~~\|pages=27–33 \|___location=New York, NY, USA \|publisher=Association for Computing Machinery \|doi=10.1145/3359993.3366766 \|hdl=10138/326309 \|isbn=978-1-4503-7000-4}} * {{Cite ~~journal~~book \|~~last~~last1=Helbig \|~~first~~first1=Maike \|last2=Kim \|first2=Younghoon \|~~date=2025-01-01 \|title~~chapter=IAPS: Decreasing Software-Based Packet Steering Overhead Through Interrupt Reduction \|~~url~~date=~~https://ieeexplore.ieee.org/document/10993154/~~2025-01-01 \|~~journal~~pages=127–130 \|title=2025 International Conference on Information Networking (ICOIN) \|doi=10.1109/ICOIN63865.2025.10993154 \|isbn=979-8-3315-0694-0 }} * {{Cite ~~journal~~book \|~~last~~last1=Kumar \|~~first~~first1=Ashwin \|last2=Katkam \|first2=Rajneesh \|last3=Chaudhary \|first3=Pranav \|last4=Naik \|first4=Priyanka \|last5=Vutukuru \|first5=Mythili \|~~date=2024-05-01 \|title~~chapter=AppSteer: Framework for Improving Multicore Scalability of Network Functions via Application-aware Packet Steering \|~~url~~date=~~https://ieeexplore.ieee.org/document/10701416/~~2024-05-01 \|~~journal~~pages=18–27 \|title=2024 IEEE 24th International Symposium on Cluster, Cloud and Internet Computing (CCGrid) \|doi=10.1109/CCGrid59990.2024.00012 \|isbn=979-8-3503-9566-2 }} * {{Cite journal \|~~last~~last1=Tyunyayev \|~~first~~first1=Nikita \|last2=Delzotti \|first2=Clément \|last3=Eran \|first3=Haggai \|last4=Barbette \|first4=Tom \|date=2025-06-09 \|title=ASNI: Redefining the Interface Between SmartNICs and Applications \|url=https://dl.acm.org/doi/10.1145/3730966 \|journal=~~Proc.~~ Proceedings of the ACM ~~Netw.~~on Networking\|volume=3 \|issue=~~CoNEXT2~~ \|pages=1–22 \|doi=10.1145/3730966\|url-access=subscription }} == External links ==