<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Communication on Echo的技术博客</title><link>https://cybersecurityerial.github.io/echo_blog/tags/communication/</link><description>Recent content in Communication on Echo的技术博客</description><generator>Hugo</generator><language>zh-cn</language><lastBuildDate>Wed, 17 Jun 2026 00:00:00 +0800</lastBuildDate><atom:link href="https://cybersecurityerial.github.io/echo_blog/tags/communication/index.xml" rel="self" type="application/rss+xml"/><item><title>LLM System: 训练框架随笔 04 - Megatron 通信器设计：如何防死锁</title><link>https://cybersecurityerial.github.io/echo_blog/posts/llm-system-training-framework-notes-04-megatron-communicator-deadlock/</link><pubDate>Wed, 17 Jun 2026 00:00:00 +0800</pubDate><guid>https://cybersecurityerial.github.io/echo_blog/posts/llm-system-training-framework-notes-04-megatron-communicator-deadlock/</guid><description>&lt;blockquote&gt;
&lt;p&gt;本篇目标：&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id="为什么通信器会死锁"&gt;为什么通信器会死锁&lt;/h2&gt;
&lt;h2 id="megatron-的-p2p-通信抽象"&gt;Megatron 的 P2P 通信抽象&lt;/h2&gt;
&lt;h2 id="batch-p2p-comm"&gt;batch p2p comm&lt;/h2&gt;
&lt;h2 id="overlap-p2p-comm"&gt;overlap p2p comm&lt;/h2&gt;
&lt;h2 id="warmup--steady--cooldown-里的通信顺序"&gt;warmup / steady / cooldown 里的通信顺序&lt;/h2&gt;
&lt;h2 id="如何设计防死锁的通信接口"&gt;如何设计防死锁的通信接口&lt;/h2&gt;
&lt;h2 id="todo"&gt;TODO&lt;/h2&gt;</description></item></channel></rss>