CryptoSpiel.com
No Result
View All Result
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams
No Result
View All Result
CryptoSpiel.com
No Result
View All Result

DeepSWE: Revolutionizing Coding Agents with Open-Source Reinforcement Learning

July 2, 2025
in Blockchain
Reading Time: 2 mins read
A A
0
Brazilian fintech giant XP Inc Launches Crypto Trading Platform XTAGE
0
SHARES
6
VIEWS
ShareShareShareShareShare


Luisa Crawford
Jul 02, 2025 17:58

DeepSWE-Preview, an advanced coding agent, sets new benchmarks in open-source AI with a 59% success rate on SWE-Bench-Verified, showcasing state-of-the-art performance using reinforcement learning.





In a significant advancement for AI-driven software development, DeepSWE-Preview has emerged as a groundbreaking open-source coding agent. Developed through a collaboration between the Agentica team and Together AI, this agent leverages reinforcement learning (RL) to achieve a remarkable 59% pass rate on the SWE-Bench-Verified benchmark, according to Together AI.

Revolutionizing Software Engineering

DeepSWE-Preview is built upon the Qwen3-32B model, utilizing only RL to enhance its capabilities. This approach allows the agent to outperform other open-weight coding agents, achieving a Pass@1 rate of 42.2% and a Pass@16 rate of 71.0%. The model was trained over six days using 64 H100 GPUs, tackling 4,500 real-world software engineering tasks sourced from the R2E-Gym training environments.

Harnessing the Power of rLLM

The training of DeepSWE-Preview is facilitated by rLLM, Agentica’s framework designed for post-training language agents. This framework allows for the open-sourcing of datasets, code, and training logs, encouraging collaborative efforts to scale and improve agents using RL. The full training recipe for developing a 32B model into an intelligent coding agent is now available to the public, promoting transparency and innovation.

Emerging Behaviors and Performance

DeepSWE-Preview has demonstrated emergent behaviors during its training, such as anticipating edge cases and conducting thorough regression tests. These capabilities are crucial for handling complex software engineering tasks, which require navigating extensive codebases and ensuring compatibility with existing functionalities.

Test-Time Scaling and Further Developments

DeepSWE-Preview employs test-time scaling (TTS) to enhance its performance, combining execution-free and execution-based verification methods. This hybrid scaling strategy significantly boosts its Pass@1 performance, setting it apart from other models. Future research aims to explore larger models and extend capabilities to different domains, including web agents.

DeepSWE-Preview represents a pivotal step in democratizing AI development, showcasing the potential of reinforcement learning to tackle long-horizon, multi-step challenges in software engineering. With its open-source nature, it invites the global research community to contribute to and build upon its successes.

Image source: Shutterstock


Credit: Source link

RELATED POSTS

Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

Exploring Chainlink’s Role Beyond Price Feeds in the Blockchain Ecosystem

Buy JNews
ADVERTISEMENT
ShareTweetSendPinShare
Previous Post

NVIDIA Omniverse Deprecates Launcher for Enhanced Developer Experience

Next Post

Futarchy: Revolutionizing Governance in Early-Stage Crypto Projects

Related Posts

Bitcoin Addresses Holding Between 100 and 10,000 BTC Hit a 7-Week High
Blockchain

Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

April 10, 2026
Riot Blockchain Yearly Bitcoin Production Increases by 236%, Accumulates $194M in BTC
Blockchain

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

April 2, 2026
Galaxy Digital: Ethereum Developers Discuss Key Upgrades During Latest Consensus Call
Blockchain

Exploring Chainlink’s Role Beyond Price Feeds in the Blockchain Ecosystem

December 9, 2025
Next Post
Galaxy Digital: Ethereum Developers Discuss Key Upgrades During Latest Consensus Call

Futarchy: Revolutionizing Governance in Early-Stage Crypto Projects

Ripple seeks a national bank charter approval to place RLUSD under state and federal oversight

Ripple seeks a national bank charter approval to place RLUSD under state and federal oversight

Recommended Stories

Can US-Iran new peace deal signal keep Bitcoin above $70,000?

Can US-Iran new peace deal signal keep Bitcoin above $70,000?

April 8, 2026
Bitcoin Addresses Holding Between 100 and 10,000 BTC Hit a 7-Week High

Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

April 10, 2026
SEC fight over tokenized stocks could decide whether Wall Street keeps control

SEC fight over tokenized stocks could decide whether Wall Street keeps control

April 7, 2026

Popular Stories

  • Winklevoss Twins Continue Crypto Donation Spree With Another $1,000,000 in Bitcoin (BTC)

    Trader Says DeFi Altcoin Aave Witnessing Clear Trend Switch, Updates Forecast on Two Low-Cap Coins

    0 shares
    Share 0 Tweet 0
  • AVAX Staking Guide: How to Stake AVAX Right From Your Core Wallet

    0 shares
    Share 0 Tweet 0
  • Coinbase Executive Says US Government Squandering Lead in Technology With Lack of Crypto Regulatory Clarity

    0 shares
    Share 0 Tweet 0
  • LangChain Expands DeepAgents Capability with New Update

    0 shares
    Share 0 Tweet 0
  • ETH Merge Will Propel Narrative of Cryptos Being Eco-Friendly: Head of Sales at Moneycorp

    0 shares
    Share 0 Tweet 0
CryptoSpiel.com

This is an online news portal that aims to provide the latest crypto news, blockchain, regulations and much more stuff like that around the world. Feel free to get in touch with us!

What’s New Here!

  • Ripple CEO Says CLARITY Act Talks Near Breakthrough as Senate Standoff Eases
  • SEC Opens Proceedings on NYSE Proposal to List Grayscale Crypto ETF Options – Regulation Bitcoin News
  • Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

Subscribe Now

Loading
  • Live Crypto Prices
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2021 - cryptospiel.com - All rights reserved!

No Result
View All Result
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams

© 2021 - cryptospiel.com - All rights reserved!

Please enter CoinGecko Free Api Key to get this plugin works.