CryptoSpiel.com
No Result
View All Result
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams
No Result
View All Result
CryptoSpiel.com
No Result
View All Result

Together AI Achieves Breakthrough Inference Speed with NVIDIA’s Blackwell GPUs

July 18, 2025
in Blockchain
Reading Time: 2 mins read
A A
0
Bitcoin Holdings in Public Company Treasuries Exceed 200,000 BTC
0
SHARES
15
VIEWS
ShareShareShareShareShare


Lawrence Jengar
Jul 18, 2025 08:45

Together AI unveils the world’s fastest inference for the DeepSeek-R1-0528 model using NVIDIA HGX B200, enhancing AI capabilities for real-world applications.





Together AI has announced a significant advancement in AI performance by offering the fastest inference for the DeepSeek-R1-0528 model, utilizing an inference engine designed for the NVIDIA HGX B200 platform. This development positions Together AI as a leading platform for running open-source reasoning models at scale, according to together.ai.

NVIDIA Blackwell Integration

Earlier this year, Together AI invited select customers, including major corporations like Zoom and Salesforce, to test NVIDIA Blackwell GPUs on its GPU Clusters. The results have led to a broader rollout of NVIDIA Blackwell support, unlocking enhanced performance for AI applications. As of July 17, 2025, the company claims to have achieved the fastest serverless inference performance for DeepSeek-R1 using this technology.

Technological Advancements

The new inference engine optimizes every layer of the stack, incorporating bespoke GPU kernels and a proprietary inference engine. These innovations aim to boost speed and efficiency without compromising model quality. The stack includes state-of-the-art speculative decoding methods and advanced model optimization techniques.

Performance Metrics

Together AI’s inference stack achieves up to 334 tokens per second, outperforming previous benchmarks. This performance is facilitated by the integration of NVIDIA’s fifth-generation Tensor Cores and the ThunderKittens framework, which Together AI uses to develop optimized GPU kernels.

Speculative Decoding and Quantization

Speculative decoding significantly accelerates large language models by using a smaller, faster speculator model to predict multiple tokens ahead. Together AI’s Turbo Speculator outperforms existing models by maintaining high target-speculator alignment across various scenarios. Additionally, Together AI has pioneered a lossless quantization technique that maintains model accuracy while reducing computational overhead.

Real-World Application

The enhancements are designed to support a range of AI workloads, offering flexible infrastructure options for both inference and training. Dedicated Endpoints provide additional optimization, delivering substantial speed improvements while maintaining quality and performance standards.

As the AI landscape continues to evolve, Together AI’s collaboration with NVIDIA and its innovative approach to inference engine development positions it as a formidable player in the race for AI supremacy.

Image source: Shutterstock


Credit: Source link

RELATED POSTS

Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

Exploring Chainlink’s Role Beyond Price Feeds in the Blockchain Ecosystem

Buy JNews
ADVERTISEMENT
ShareTweetSendPinShare
Previous Post

Pakistan Establishes Ties With El Salvador, With Bitcoin Front and Center

Next Post

Onchain Infrastructure Outshines in PUMP Token Trading and ICO

Related Posts

Bitcoin Addresses Holding Between 100 and 10,000 BTC Hit a 7-Week High
Blockchain

Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

April 10, 2026
Riot Blockchain Yearly Bitcoin Production Increases by 236%, Accumulates $194M in BTC
Blockchain

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

April 2, 2026
Galaxy Digital: Ethereum Developers Discuss Key Upgrades During Latest Consensus Call
Blockchain

Exploring Chainlink’s Role Beyond Price Feeds in the Blockchain Ecosystem

December 9, 2025
Next Post
Galaxy Digital: Ethereum Developers Discuss Key Upgrades During Latest Consensus Call

Onchain Infrastructure Outshines in PUMP Token Trading and ICO

Australia’s First Direct-Holding Bitcoin ETF Set to Launch on Tuesday

Aussies Can Now Use Bitcoin to Back Their Home Loans

Recommended Stories

Riot Blockchain Yearly Bitcoin Production Increases by 236%, Accumulates $194M in BTC

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

April 2, 2026
Coinbase Adds Little-Known Crypto Asset to Listing Roadmap for Spot Trading

Coinbase Adds Little-Known Crypto Asset to Listing Roadmap for Spot Trading

March 25, 2026
Stabble Urges Users to Pull Liquidity After Alleged North Korean Hacker Link

Stabble Urges Users to Pull Liquidity After Alleged North Korean Hacker Link

April 8, 2026

Popular Stories

  • Winklevoss Twins Continue Crypto Donation Spree With Another $1,000,000 in Bitcoin (BTC)

    Trader Says DeFi Altcoin Aave Witnessing Clear Trend Switch, Updates Forecast on Two Low-Cap Coins

    0 shares
    Share 0 Tweet 0
  • SEC charges former 4chan favorite Rivetz over $18 million ICO

    0 shares
    Share 0 Tweet 0
  • Bitfinex Successfully Prevents $15 Billion XRP Exploit Attempt

    0 shares
    Share 0 Tweet 0
  • SEC launches proceedings to determine fate of spot Bitcoin ETFs, invites public comment

    0 shares
    Share 0 Tweet 0
  • UNI Price Consolidates Above $7 as Uniswap Tests Mid-Range Support in Quiet Market

    0 shares
    Share 0 Tweet 0
CryptoSpiel.com

This is an online news portal that aims to provide the latest crypto news, blockchain, regulations and much more stuff like that around the world. Feel free to get in touch with us!

What’s New Here!

  • Ripple CEO Says CLARITY Act Talks Near Breakthrough as Senate Standoff Eases
  • SEC Opens Proceedings on NYSE Proposal to List Grayscale Crypto ETF Options – Regulation Bitcoin News
  • Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

Subscribe Now

Loading
  • Live Crypto Prices
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2021 - cryptospiel.com - All rights reserved!

No Result
View All Result
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams

© 2021 - cryptospiel.com - All rights reserved!

Please enter CoinGecko Free Api Key to get this plugin works.