CryptoSpiel.com
No Result
View All Result
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams
No Result
View All Result
CryptoSpiel.com
No Result
View All Result

Together AI Unveils Inference Engine 2.0 with Turbo and Lite Endpoints

July 18, 2024
in Blockchain
Reading Time: 2 mins read
A A
0
Bitcoin Holdings in Public Company Treasuries Exceed 200,000 BTC
0
SHARES
4
VIEWS
ShareShareShareShareShare


Terrill Dicki
Jul 18, 2024 18:41

Together AI launches Inference Engine 2.0, offering Turbo and Lite endpoints for enhanced performance, quality, and cost-efficiency.





Together AI has announced the release of its new Inference Engine 2.0, which includes the highly anticipated Turbo and Lite endpoints. This new inference stack is designed to provide significantly faster decoding throughput and superior performance compared to existing solutions.

Performance Enhancements

According to together.ai, the Together Inference Engine 2.0 offers decoding throughput that is four times faster than the open-source vLLM and outperforms commercial solutions such as Amazon Bedrock, Azure AI, Fireworks, and Octo AI by 1.3x to 2.5x. The engine achieves over 400 tokens per second on Meta Llama 3 8B, thanks to advancements in FlashAttention-3, faster GEMM & MHA kernels, quality-preserving quantization, and speculative decoding.

New Turbo and Lite Endpoints

Together AI has introduced new Turbo and Lite endpoints, starting with Meta Llama 3. These endpoints aim to balance performance, quality, and cost, allowing enterprises to avoid compromises. Together Turbo closely matches the quality of full-precision FP16 models, while Together Lite offers the most cost-efficient and scalable Llama 3 models available.

Together Turbo endpoints provide fast FP8 performance while maintaining quality, matching FP16 reference models and outperforming other FP8 solutions on AlpacaEval 2.0. These Turbo endpoints are priced at $0.88 per million tokens for 70B and $0.18 for 8B, making them significantly more affordable than GPT-4o.

Together Lite endpoints use INT4 quantization to offer high-quality AI models at a lower cost, priced at $0.10 per million tokens for Llama 3 8B Lite, which is six times lower than GPT-4o-mini.

Adoption and Endorsements

Over 100,000 developers and companies, including Zomato, DuckDuckGo, and the Washington Post, are already utilizing the Together Inference Engine for their Generative AI applications. Rinshul Chandra, COO of Food Delivery at Zomato, praised the engine for its high quality, speed, and accuracy.

Technical Innovations

The Together Inference Engine 2.0 incorporates several technical advancements, including FlashAttention-3, custom-built speculators, and quality-preserving quantization techniques. These innovations contribute to the engine’s superior performance and cost-efficiency.

Future Outlook

Together AI plans to continue pushing the boundaries of AI acceleration. The company aims to extend support for new models, techniques, and kernels, ensuring the Together Inference Engine remains at the forefront of AI technology.

The Turbo and Lite endpoints for Llama 3 models are available starting today, with plans to expand to other models soon. For more information, visit the Together AI pricing page.

Image source: Shutterstock


Credit: Source link

RELATED POSTS

Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

Exploring Chainlink’s Role Beyond Price Feeds in the Blockchain Ecosystem

Buy JNews
ADVERTISEMENT
ShareTweetSendPinShare
Previous Post

XRP Anticipates 59% Price Surge

Next Post

LangChain Enhances Core Tool Interfaces and Documentation

Related Posts

Bitcoin Addresses Holding Between 100 and 10,000 BTC Hit a 7-Week High
Blockchain

Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

April 10, 2026
Riot Blockchain Yearly Bitcoin Production Increases by 236%, Accumulates $194M in BTC
Blockchain

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

April 2, 2026
Galaxy Digital: Ethereum Developers Discuss Key Upgrades During Latest Consensus Call
Blockchain

Exploring Chainlink’s Role Beyond Price Feeds in the Blockchain Ecosystem

December 9, 2025
Next Post
LangChain Introduces Self-Improving Evaluators for LLM-as-a-Judge

LangChain Enhances Core Tool Interfaces and Documentation

Bankrupt Crypto Lender BlockFi to Commence Repayments This Month 

Bankrupt Crypto Lender BlockFi to Commence Repayments This Month 

Recommended Stories

SEC Opens Proceedings on NYSE Proposal to List Grayscale Crypto ETF Options – Regulation Bitcoin News

SEC Opens Proceedings on NYSE Proposal to List Grayscale Crypto ETF Options – Regulation Bitcoin News

April 11, 2026
Ripple CEO Says CLARITY Act Talks Near Breakthrough as Senate Standoff Eases

Ripple CEO Says CLARITY Act Talks Near Breakthrough as Senate Standoff Eases

April 14, 2026

Popular Stories

  • Winklevoss Twins Continue Crypto Donation Spree With Another $1,000,000 in Bitcoin (BTC)

    Trader Says DeFi Altcoin Aave Witnessing Clear Trend Switch, Updates Forecast on Two Low-Cap Coins

    0 shares
    Share 0 Tweet 0
  • Kraken’s Jesse Powell Warns of Looming Government Crackdown on Bitcoin and Crypto Assets

    0 shares
    Share 0 Tweet 0
  • To Avoid a Global Recession the Fed Should Ease Interest Rate Hikes – UN Report

    0 shares
    Share 0 Tweet 0
  • Over $1,260,000,000 Stolen From Ethereum-Dominated Crypto Sector in Q1 This Year: FBI

    0 shares
    Share 0 Tweet 0
  • Analyst Says Speculators and Bitcoin Miners Responsible for BTC’s Recent Plunge Below $60,000

    0 shares
    Share 0 Tweet 0
CryptoSpiel.com

This is an online news portal that aims to provide the latest crypto news, blockchain, regulations and much more stuff like that around the world. Feel free to get in touch with us!

What’s New Here!

  • Ripple CEO Says CLARITY Act Talks Near Breakthrough as Senate Standoff Eases
  • SEC Opens Proceedings on NYSE Proposal to List Grayscale Crypto ETF Options – Regulation Bitcoin News
  • Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

Subscribe Now

Loading
  • Live Crypto Prices
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2021 - cryptospiel.com - All rights reserved!

No Result
View All Result
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams

© 2021 - cryptospiel.com - All rights reserved!

Please enter CoinGecko Free Api Key to get this plugin works.