<?xml version="1.0" encoding="utf-8"?><?xml-stylesheet type="text/xsl" href="atom.xsl"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <id>https://losslessnetwork.com/blog</id>
    <title>Lossless Network Blog</title>
    <updated>2026-04-28T00:00:00.000Z</updated>
    <generator>https://github.com/jpmonette/feed</generator>
    <link rel="alternate" href="https://losslessnetwork.com/blog"/>
    <subtitle>Lossless Network Blog</subtitle>
    <icon>https://losslessnetwork.com/img/favicon.svg</icon>
    <entry>
        <title type="html"><![CDATA[Hello, Lossless Network]]></title>
        <id>https://losslessnetwork.com/blog/hello-losslessnetwork</id>
        <link href="https://losslessnetwork.com/blog/hello-losslessnetwork"/>
        <updated>2026-04-28T00:00:00.000Z</updated>
        <summary type="html"><![CDATA[TLDR: New site, built for network engineers entering AI. Deep modules + fast blog + zero vendor noise. First module drops soon.]]></summary>
        <content type="html"><![CDATA[<p><strong>TLDR:</strong> New site, built for network engineers entering AI. Deep modules + fast blog + zero vendor noise. First module drops soon.</p>
<hr>
<p>Welcome to <strong>Lossless Network</strong> — AI networking, distilled for network engineers.</p>
<p>If you've ever tried to design or operate a network fabric for large-scale AI training, you've felt the gap. The standards are public. The vendor whitepapers exist. But nowhere is there a single, opinionated, technically honest walkthrough of how the pieces fit together — written by a network engineer, for network engineers — and why some choices that look right on paper fall apart at scale.</p>
<!-- -->
<p>This site is my attempt to fix that.</p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="who-this-is-for">Who this is for<a href="https://losslessnetwork.com/blog/hello-losslessnetwork#who-this-is-for" class="hash-link" aria-label="Direct link to Who this is for" title="Direct link to Who this is for" translate="no">​</a></h2>
<p>You're a network engineer who:</p>
<ul>
<li class="">Builds and operates production data center fabrics</li>
<li class="">Has been told to "support AI workloads" — which now means RoCEv2, lossless Ethernet, NCCL collectives, and topologies that look nothing like CLOS</li>
<li class="">Wants to actually <em>understand</em> what's happening on the wire when 1,024 GPUs run all-reduce simultaneously</li>
<li class="">Refuses to pretend a vendor slide deck is a design document</li>
</ul>
<p>If that's you, you're home.</p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="the-promise">The promise<a href="https://losslessnetwork.com/blog/hello-losslessnetwork#the-promise" class="hash-link" aria-label="Direct link to The promise" title="Direct link to The promise" translate="no">​</a></h2>
<p><strong>Deep when you need depth. Fast when you need speed.</strong></p>
<ul>
<li class="">Got 30 seconds? Read the <strong>TLDR</strong> at the top of every post. That's all you need.</li>
<li class="">Got 5 minutes? Read the blog. Sharp takes, no filler.</li>
<li class="">Got an afternoon? Take a module. First principles to production deployment, end to end.</li>
</ul>
<p>You decide how deep to go. The content respects your time.</p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="whats-coming">What's coming<a href="https://losslessnetwork.com/blog/hello-losslessnetwork#whats-coming" class="hash-link" aria-label="Direct link to What's coming" title="Direct link to What's coming" translate="no">​</a></h2>
<p>Six modules, written in order:</p>
<ol>
<li class=""><strong>RDMA Fundamentals</strong> — verbs, QPs, MRs, and why kernel-bypass exists</li>
<li class=""><strong>RoCEv2 &amp; Lossless Ethernet</strong> — PFC, ECN, DCQCN, and what makes RDMA work over Ethernet</li>
<li class=""><strong>AI Fabric Architecture</strong> — rail-optimized topologies, NCCL, the all-reduce bottleneck</li>
<li class=""><strong>Congestion Control</strong> — the actual tuning, with numbers</li>
<li class=""><strong>Adaptive Routing</strong> — DLB, FLB, and why static ECMP kills GPU jobs</li>
<li class=""><strong>UEC &amp; The Future</strong> — what comes after RoCE</li>
</ol>
<p>Plus a blog for everything that doesn't fit the module structure — field notes, debugging stories, paper reviews, and "the thing the vendor didn't tell you" posts.</p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="what-this-site-is-not">What this site is not<a href="https://losslessnetwork.com/blog/hello-losslessnetwork#what-this-site-is-not" class="hash-link" aria-label="Direct link to What this site is not" title="Direct link to What this site is not" translate="no">​</a></h2>
<ul>
<li class="">A vendor pitch</li>
<li class="">A reskinned wiki dump</li>
<li class="">AI-generated filler</li>
</ul>
<p>Every word is written by hand, reviewed by hand, and grounded in real engineering experience.</p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="stay-updated">Stay updated<a href="https://losslessnetwork.com/blog/hello-losslessnetwork#stay-updated" class="hash-link" aria-label="Direct link to Stay updated" title="Direct link to Stay updated" translate="no">​</a></h2>
<p>The blog has an RSS feed. New modules drop one at a time. Follow along, push back when I'm wrong, and let's build the resource the AI networking field has been missing.</p>
<p>— Nagarjun</p>]]></content>
        <author>
            <name>Nagarjun Velmurugan</name>
            <uri>/about</uri>
        </author>
        <category label="Meta" term="Meta"/>
    </entry>
</feed>