CryptoSpiel.com
No Result
View All Result
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams
No Result
View All Result
CryptoSpiel.com
No Result
View All Result

NVIDIA Unveils DoRA: A Superior Fine-Tuning Method for AI Models

June 29, 2024
in Blockchain
Reading Time: 2 mins read
A A
0
Nvidia Plans to add Innovation in the Metaverse with Software, Marketplace Deals
0
SHARES
8
VIEWS
ShareShareShareShareShare





NVIDIA has announced the development of a new fine-tuning method called DoRA (Weight-Decomposed Low-Rank Adaptation), which offers a high-performing alternative to the widely used Low-Rank Adaptation (LoRA). According to the NVIDIA Technical Blog, DoRA enhances both the learning capacity and stability of LoRA without introducing any additional inference overhead.

RELATED POSTS

Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

Exploring Chainlink’s Role Beyond Price Feeds in the Blockchain Ecosystem

Advantages of DoRA

DoRA has demonstrated significant performance improvements across various large language models (LLMs) and vision language models (VLMs). For instance, in common-sense reasoning tasks, DoRA outperformed LoRA with improvements such as +3.7 points on Llama 7B and +4.4 points on Llama 3 8B. Additionally, DoRA showed better results in multi-turn benchmarks, image/video-text understanding, and visual instruction tuning tasks.

This innovative method has been accepted as an oral paper at ICML 2024, marking its credibility and potential impact in the field of machine learning.

Mechanics of DoRA

DoRA operates by decomposing the pretrained weight into its magnitude and directional components, fine-tuning both. The method leverages LoRA for directional adaptation, ensuring efficient fine-tuning. After the training process, DoRA merges the fine-tuned components back into the pretrained weight, avoiding any additional latency during inference.

Visualizations of the magnitude and directional differences between DoRA and pretrained weights reveal that DoRA makes substantial directional adjustments with minimal changes in magnitude, closely resembling full fine-tuning (FT) learning patterns.

Performance Across Models

In various performance benchmarks, DoRA consistently outperforms LoRA. For example, in large language models, DoRA significantly enhances commonsense reasoning abilities and conversation/instruction-following capabilities. In vision language models, DoRA shows superior results in image-text and video-text understanding, as well as visual instruction tuning tasks.

Large Language Models

Comparative studies highlight that DoRA surpasses LoRA in commonsense reasoning benchmarks and multi-turn benchmarks. In tests, DoRA achieved higher average scores across various datasets, indicating its robust performance.

Buy JNews
ADVERTISEMENT

Vision Language Models

DoRA also excels in vision language models, outperforming LoRA in tasks like image-text understanding, video-text understanding, and visual instruction tuning. The method’s efficacy is evident in higher average scores across multiple benchmarks.

Compression-Aware LLMs

DoRA can be integrated into the QLoRA framework, enhancing the accuracy of low-bit pretrained models. Collaborative efforts with Answer.AI on the QDoRA project showed that QDoRA outperforms both FT and QLoRA on Llama 2 and Llama 3 models.

Text-to-Image Generation

DoRA’s application extends to text-to-image personalization with DreamBooth, yielding significantly better results than LoRA in challenging datasets like 3D Icon and Lego sets.

Implications and Future Applications

DoRA is poised to become a default choice for fine-tuning AI models, compatible with LoRA and its variants. Its efficiency and effectiveness make it a valuable tool for adapting foundation models to various applications, including NVIDIA Metropolis, NVIDIA NeMo, NVIDIA NIM, and NVIDIA TensorRT.

For more detailed information, visit the NVIDIA Technical Blog.

Image source: Shutterstock



Credit: Source link

ShareTweetSendPinShare
Previous Post

BitMEX Implements Index Weights Update for Q3 2024

Next Post

Uniswap (UNI) v4 Hooks: EigenLayer Research Calls for AVS Proposals

Related Posts

Bitcoin Addresses Holding Between 100 and 10,000 BTC Hit a 7-Week High
Blockchain

Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

April 10, 2026
Riot Blockchain Yearly Bitcoin Production Increases by 236%, Accumulates $194M in BTC
Blockchain

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

April 2, 2026
Galaxy Digital: Ethereum Developers Discuss Key Upgrades During Latest Consensus Call
Blockchain

Exploring Chainlink’s Role Beyond Price Feeds in the Blockchain Ecosystem

December 9, 2025
Next Post
Bitcoin Addresses Holding Between 100 and 10,000 BTC Hit a 7-Week High

Uniswap (UNI) v4 Hooks: EigenLayer Research Calls for AVS Proposals

The Forex Robot That’s Making Waves in EURUSD Trading

The Forex Robot That's Making Waves in EURUSD Trading

Recommended Stories

Ripple CEO Says CLARITY Act Talks Near Breakthrough as Senate Standoff Eases

Ripple CEO Says CLARITY Act Talks Near Breakthrough as Senate Standoff Eases

April 14, 2026
SEC fight over tokenized stocks could decide whether Wall Street keeps control

SEC fight over tokenized stocks could decide whether Wall Street keeps control

April 7, 2026
Treasury Proposes Stablecoin AML Rules as Bessent Vows to Protect US Financial System – Crypto News Bitcoin News

Treasury Proposes Stablecoin AML Rules as Bessent Vows to Protect US Financial System – Crypto News Bitcoin News

April 8, 2026

Popular Stories

  • Winklevoss Twins Continue Crypto Donation Spree With Another $1,000,000 in Bitcoin (BTC)

    Trader Says DeFi Altcoin Aave Witnessing Clear Trend Switch, Updates Forecast on Two Low-Cap Coins

    0 shares
    Share 0 Tweet 0
  • Kraken’s Jesse Powell Warns of Looming Government Crackdown on Bitcoin and Crypto Assets

    0 shares
    Share 0 Tweet 0
  • Gensler says SEC can consider tailoring rules for crypto industry compliance

    0 shares
    Share 0 Tweet 0
  • SSV Network brings us Ethereum Staking with its New Permisionless Mainnet

    0 shares
    Share 0 Tweet 0
  • Central Reserve Bank: Only 1.1% of Remittances Involve Cryptocurrency in El Salvador

    0 shares
    Share 0 Tweet 0
CryptoSpiel.com

This is an online news portal that aims to provide the latest crypto news, blockchain, regulations and much more stuff like that around the world. Feel free to get in touch with us!

What’s New Here!

  • Ripple CEO Says CLARITY Act Talks Near Breakthrough as Senate Standoff Eases
  • SEC Opens Proceedings on NYSE Proposal to List Grayscale Crypto ETF Options – Regulation Bitcoin News
  • Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

Subscribe Now

Loading
  • Live Crypto Prices
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2021 - cryptospiel.com - All rights reserved!

No Result
View All Result
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams

© 2021 - cryptospiel.com - All rights reserved!

Please enter CoinGecko Free Api Key to get this plugin works.