Thursday, April 23, 2026
No Result
View All Result
The Crypto HODL
  • Home
  • Bitcoin
  • Crypto Updates
    • Altcoin
    • Ethereum
    • Crypto Updates
    • Crypto Mining
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Web3
  • Metaverse
  • Regulations
  • Scam Alert
  • Analysis
  • Videos
Marketcap
  • Home
  • Bitcoin
  • Crypto Updates
    • Altcoin
    • Ethereum
    • Crypto Updates
    • Crypto Mining
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Web3
  • Metaverse
  • Regulations
  • Scam Alert
  • Analysis
  • Videos
No Result
View All Result
The Crypto HODL
No Result
View All Result

NVIDIA Megatron Boosts LLM Training With Muon Optimizer

April 23, 2026
in Blockchain
Reading Time: 2 mins read
0 0
A A
0
Home Blockchain
Share on FacebookShare on Twitter


Zach Anderson
Apr 22, 2026 20:41

NVIDIA integrates Muon and superior optimizers into Megatron to boost large-scale LLM coaching with near-parity throughput to AdamW.

NVIDIA is pushing the boundaries of huge language mannequin (LLM) coaching with its integration of superior optimizers like Muon into the Megatron Core framework. In response to NVIDIA’s April 22, 2026 weblog publish, the Muon optimizer, based mostly on higher-order mathematical strategies, has achieved near-parity coaching throughput with the extensively used AdamW optimizer whereas enhancing mannequin efficiency on large-scale techniques just like the NVIDIA GB300 NVL72.

Muon, quick for MomentUm Orthogonalized by Newton-Schulz, is a higher-order optimization algorithm. It has been instrumental in coaching main open-source fashions akin to Kimi K2 and GLM-5. By leveraging superior preconditioning methods, the optimizer ensures larger FLOPs utilization (floating level operations per second), a crucial metric for maximizing computational effectivity in LLMs.

Efficiency Metrics: Muon vs. AdamW

Desk 1 from NVIDIA’s report reveals that Muon delivers comparable throughput to AdamW on the GB300 NVL72 system. As an example, the Kimi K2 mannequin achieved 1,080 TFLOPs/s/GPU with Muon, barely surpassing AdamW’s 1,051 TFLOPs/s/GPU. Equally, the Qwen3 30B mannequin reached 721 TFLOPs/s/GPU with Muon in comparison with 713 TFLOPs/s/GPU with AdamW.

These outcomes have been obtained utilizing the NVIDIA NeMo Megatron Bridge 26.02, a PyTorch-native library designed for pretraining and fine-tuning LLMs. The high-performance benchmarks spotlight Muon’s means to deal with the computational calls for of recent AI workloads with out sacrificing effectivity.

Technological Improvements

Scaling Muon to 1000’s of GPUs presents challenges, together with elevated computational and reminiscence prices throughout preconditioning, in addition to communication bottlenecks in distributed techniques. NVIDIA addresses these hurdles by a number of improvements:

Layer-Clever Distributed Optimizer: Full layers of mannequin parameters are distributed throughout GPUs, enabling environment friendly preconditioning with out extreme communication overhead.
Distributed Newton-Schulz: Two modes—duplicated and distributed—permit versatile dealing with of momentum updates. Whereas the duplicated mode minimizes latency, the distributed mode optimizes computational effectivity.
Communication Hiding and SYRK Fusion: Methods like overlapping parameter updates with computation and fusing SYRK operations with communication considerably scale back latency, boosting general throughput.

Implications and Future Developments

By integrating Muon into the Megatron Core, NVIDIA is equipping researchers and builders with instruments to enhance LLM coaching at scale. The near-parity efficiency with AdamW makes Muon a pretty alternative, particularly as upcoming updates promise additional effectivity positive factors. These embody enhanced load balancing, higher communication methods, and superior kernel optimizations for SYRK operations.

For these desirous to discover these applied sciences, NVIDIA has made instruments and efficiency recipes out there by its Megatron Bridge GitHub repository. With these assets, researchers can implement and benchmark rising optimizers like Muon in their very own LLM tasks.

Picture supply: Shutterstock



Source link

Tags: BoostsLLMMegatronMuonNvidiaOptimizerTraining
Previous Post

How to Use AI as a Cognitive Prosthetic to Enhance Human Creativity

Next Post

Your Business Already Has the Most Valuable AI Asset. You Just Haven’t Extracted It Yet.

Related Posts

AI-Powered Geotechnical Data Platform Transforms NZ Infrastructure
Blockchain

AI-Powered Geotechnical Data Platform Transforms NZ Infrastructure

April 22, 2026
Kalshi Plans Crypto Perpetual Futures to Expand Beyond Prediction Markets
Blockchain

Kalshi Plans Crypto Perpetual Futures to Expand Beyond Prediction Markets

April 22, 2026
Blockchain.com Adds Perps Trading to Self-Custody Wallets
Blockchain

Blockchain.com Adds Perps Trading to Self-Custody Wallets

April 21, 2026
35% of EU Investors May Switch Banks for Crypto Access, Survey Finds
Blockchain

35% of EU Investors May Switch Banks for Crypto Access, Survey Finds

April 21, 2026
Success Story: Douglas Vernon’s Learning Journey with 101 Blockchains
Blockchain

Success Story: Douglas Vernon’s Learning Journey with 101 Blockchains

April 21, 2026
Arbitrum Freezes $71M in ETH Linked to Kelp DAO Hack
Blockchain

Arbitrum Freezes $71M in ETH Linked to Kelp DAO Hack

April 21, 2026
Next Post
Your Business Already Has the Most Valuable AI Asset. You Just Haven’t Extracted It Yet.

Your Business Already Has the Most Valuable AI Asset. You Just Haven't Extracted It Yet.

US Government Runs a Bitcoin Node, But Not Mining BTC: US Admiral

US Government Runs a Bitcoin Node, But Not Mining BTC: US Admiral

Shiba Inu Could Stage A Return As 20% Move Puts It Ahead Of Bitcoin And XRP In This Metric

Shiba Inu Could Stage A Return As 20% Move Puts It Ahead Of Bitcoin And XRP In This Metric

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Twitter Instagram LinkedIn Telegram RSS
The Crypto HODL

Find the latest Bitcoin, Ethereum, blockchain, crypto, Business, Fintech News, interviews, and price analysis at The Crypto HODL

CATEGORIES

  • Altcoin
  • Analysis
  • Bitcoin
  • Blockchain
  • Crypto Exchanges
  • Crypto Mining
  • Crypto Updates
  • DeFi
  • Ethereum
  • Metaverse
  • NFT
  • Regulations
  • Scam Alert
  • Uncategorized
  • Videos
  • Web3

SITE MAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2023 The Crypto HODL.
The Crypto HODL is not responsible for the content of external sites.

No Result
View All Result
  • Home
  • Bitcoin
  • Crypto Updates
    • Altcoin
    • Ethereum
    • Crypto Updates
    • Crypto Mining
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Web3
  • Metaverse
  • Regulations
  • Scam Alert
  • Analysis
  • Videos
Crypto Marketcap

Copyright © 2023 The Crypto HODL.
The Crypto HODL is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In