CryptoSpiel.com
No Result
View All Result
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams
No Result
View All Result
CryptoSpiel.com
No Result
View All Result

AMD Introduces AMD-135M: A Breakthrough in Small Language Models

September 28, 2024
in Blockchain
Reading Time: 2 mins read
A A
0
AMD Enhances AI Algorithm Efficiency with Innovative Depth Pruning Method
0
SHARES
4
VIEWS
ShareShareShareShareShare


Luisa Crawford
Sep 28, 2024 07:13

AMD has unveiled its first small language model, AMD-135M, with Speculative Decoding, enhancing AI model efficiency and performance.





In a significant development within the artificial intelligence sector, AMD has announced the release of its first small language model (SLM), AMD-135M. This new model aims to offer specialized capabilities while addressing some of the limitations faced by large language models (LLMs) such as GPT-4 and Llama, according to AMD.com.

AMD-135M: First AMD Small Language Model

The AMD-135M, part of the Llama family, is AMD’s pioneering effort in the SLM arena. The model was trained from scratch using AMD Instinct™ MI250 accelerators and 670 billion tokens. The training process resulted in two distinct models: AMD-Llama-135M and AMD-Llama-135M-code. The former underwent pretraining with general data, while the latter was fine-tuned with an additional 20 billion tokens specifically for code data.

Pretraining: AMD-Llama-135M was trained over six days using four MI250 nodes. The code-focused variant, AMD-Llama-135M-code, required an additional four days for fine-tuning.

All associated training code, datasets, and model weights are open-sourced, enabling developers to reproduce the model and contribute to the training of other SLMs and LLMs.

Optimization with Speculative Decoding

One of the notable advancements in AMD-135M is the use of speculative decoding. Traditional autoregressive approaches in large language models often suffer from low memory access efficiency, as each forward pass generates only a single token. Speculative decoding addresses this by employing a small draft model to generate candidate tokens, which are then verified by a larger target model. This method allows multiple tokens to be generated per forward pass, significantly improving memory access efficiency and inference speed.

Inference Performance Acceleration

AMD has tested the performance of AMD-Llama-135M-code as a draft model for CodeLlama-7b on various hardware configurations, including the MI250 accelerator and the Ryzen™ AI processor. The results indicated a considerable speedup in inference performance when speculative decoding was employed. This enhancement establishes an end-to-end workflow for training and inferencing on selected AMD platforms.

Next Steps

By providing an open-source reference implementation, AMD aims to foster innovation within the AI community. The company encourages developers to explore and contribute to this new frontier in AI technology.

For more details on AMD-135M, visit the full technical blog on AMD.com.

Image source: Shutterstock


Credit: Source link

RELATED POSTS

Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

Exploring Chainlink’s Role Beyond Price Feeds in the Blockchain Ecosystem

Buy JNews
ADVERTISEMENT
ShareTweetSendPinShare
Previous Post

Upcoming Quarter Looking Solid for Bitcoin and Altcoins, Says Analyst Kevin Svenson – Here’s His Outlook

Next Post

AI-Powered Chatbot Transforms Farming in Malawi with Multilingual Support

Related Posts

Bitcoin Addresses Holding Between 100 and 10,000 BTC Hit a 7-Week High
Blockchain

Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

April 10, 2026
Riot Blockchain Yearly Bitcoin Production Increases by 236%, Accumulates $194M in BTC
Blockchain

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

April 2, 2026
Galaxy Digital: Ethereum Developers Discuss Key Upgrades During Latest Consensus Call
Blockchain

Exploring Chainlink’s Role Beyond Price Feeds in the Blockchain Ecosystem

December 9, 2025
Next Post
Nvidia Plans to add Innovation in the Metaverse with Software, Marketplace Deals

AI-Powered Chatbot Transforms Farming in Malawi with Multilingual Support

Shiba Inu (SHIB) Explodes 18% Daily, Bitcoin (BTC) Taps $65K (Weekend Watch)

Shiba Inu's (SHIB) Price Shot Up to 3-Month High, Bitcoin (BTC) Touched $66.5K (Weekend Watch)

Recommended Stories

Stabble Urges Users to Pull Liquidity After Alleged North Korean Hacker Link

Stabble Urges Users to Pull Liquidity After Alleged North Korean Hacker Link

April 8, 2026
Bitcoin Addresses Holding Between 100 and 10,000 BTC Hit a 7-Week High

Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

April 10, 2026
Treasury Proposes Stablecoin AML Rules as Bessent Vows to Protect US Financial System – Crypto News Bitcoin News

Treasury Proposes Stablecoin AML Rules as Bessent Vows to Protect US Financial System – Crypto News Bitcoin News

April 8, 2026

Popular Stories

  • Winklevoss Twins Continue Crypto Donation Spree With Another $1,000,000 in Bitcoin (BTC)

    Trader Says DeFi Altcoin Aave Witnessing Clear Trend Switch, Updates Forecast on Two Low-Cap Coins

    0 shares
    Share 0 Tweet 0
  • Binance Signs Exclusive NFT Partnership With Football Icon Cristiano Ronaldo

    0 shares
    Share 0 Tweet 0
  • SEC Scholars Program Opens Applications for Fall 2023 Internship

    0 shares
    Share 0 Tweet 0
  • China’s Guangdong Province Aims to Lead in Quality and Innovation by Embracing Blockchain and AI Technologies

    0 shares
    Share 0 Tweet 0
  • Grayscale Considering 25 More Crypto Assets for Investment Products – Altcoins Bitcoin News

    0 shares
    Share 0 Tweet 0
CryptoSpiel.com

This is an online news portal that aims to provide the latest crypto news, blockchain, regulations and much more stuff like that around the world. Feel free to get in touch with us!

What’s New Here!

  • Ripple CEO Says CLARITY Act Talks Near Breakthrough as Senate Standoff Eases
  • SEC Opens Proceedings on NYSE Proposal to List Grayscale Crypto ETF Options – Regulation Bitcoin News
  • Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

Subscribe Now

Loading
  • Live Crypto Prices
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2021 - cryptospiel.com - All rights reserved!

No Result
View All Result
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams

© 2021 - cryptospiel.com - All rights reserved!

Please enter CoinGecko Free Api Key to get this plugin works.