CryptoSpiel.com
No Result
View All Result
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams
No Result
View All Result
CryptoSpiel.com
No Result
View All Result

Dragonfly: Enhanced Vision-Language Model with Multi-Resolution Zoom Launched by Together.ai

June 10, 2024
in Blockchain
Reading Time: 3 mins read
A A
0
Nvidia Plans to add Innovation in the Metaverse with Software, Marketplace Deals
0
SHARES
8
VIEWS
ShareShareShareShareShare





Together.ai has announced the launch of Dragonfly, an innovative vision-language model designed to enhance fine-grained visual understanding and reasoning about image regions. The architecture leverages multi-resolution zoom-and-select capabilities to optimize multi-modal reasoning while maintaining context efficiency, according to Together AI.

RELATED POSTS

Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

Exploring Chainlink’s Role Beyond Price Feeds in the Blockchain Ecosystem

Dragonfly Model Architecture

Dragonfly employs two primary strategies: multi-resolution visual encoding and zoom-in patch selection. These techniques enable the model to focus on fine-grained details of image regions, enhancing its commonsense reasoning capabilities. The architecture processes images at multiple resolutions—low, medium, and high—dividing each image into sub-images that are encoded into visual tokens. These tokens are then projected into a language space, forming a concatenated sequence that feeds into the language model.

Zoom-in Patch Selection: Dragonfly employs a selective approach for high-resolution images, identifying and retaining only the sub-images that provide the most significant visual information. This targeted selection reduces redundancy and improves the overall model efficiency.

Performance and Evaluation

Dragonfly demonstrates promising performance on several vision-language benchmarks, including commonsense visual question answering and image captioning. The model achieved competitive results on benchmarks such as AI2D, ScienceQA, MMMU, MMVet, and POPE, showcasing its effectiveness in fine-grained understanding of image regions.

Benchmark Performance:









Model AI2D ScienceQA MMMU MMVet POPE
VILA – 68.2   34.9 85.5
LLaVA-v1.5 (Vicuna-7B) 54.8 70.4 35.3 30.5 85.9
LLaVA-v1.6 (Mistral-7B) 60.8 72.8 33.4 44.8 86.7
QWEN-VL-chat 52.3 68.2 35.9 – –
Dragonfly (LLaMA-8B) 63.6 80.5 37.8 35.9 91.2

Dragonfly-Med

In collaboration with Stanford Medicine, Together.ai has also introduced Dragonfly-Med, a version fine-tuned on 1.4 million biomedical image-instruction data. This model excels in high-resolution medical data tasks, outperforming previous models like Med-Gemini on multiple medical imaging benchmarks.

Evaluation on Medical Benchmarks

Dragonfly-Med was evaluated on visual question-answering and clinical report generation tasks, achieving state-of-the-art results on several benchmarks:

Buy JNews
ADVERTISEMENT







Dataset Metric Med-Gemini Dragonfly-Med (LLaMA-8B)
VQA-RAD Acc (closed) 69.7 77.4
SLAKE Acc (closed) 84.8 90.4
Path-VQA Acc (closed) 83.3 92.3

Conclusion and Future Work

Dragonfly’s architecture offers a new research direction by focusing on zooming in on image regions to capture more fine-grained visual information. Together.ai plans to continue improving the model’s capabilities and exploring new architectures and visual encoding strategies to benefit broader scientific fields.

The collaboration with Stanford Medicine and the utilization of resources like Meta LLaMA3 and CLIP from OpenAI have been crucial in developing Dragonfly. The model’s codebase also builds upon the foundations of Otter and LLaVA-UHD.

Image source: Shutterstock

. . .

Tags


Credit: Source link

ShareTweetSendPinShare
Previous Post

Bitcoin Dominance Rises as Binance Coin and Other Alts Turn Red (Market Watch)

Next Post

Economist Henrik Zeberg Says Altcoins Set To Go ‘Flying’ in Blow-Off Top Style Euphoric Bull Run

Related Posts

Bitcoin Addresses Holding Between 100 and 10,000 BTC Hit a 7-Week High
Blockchain

Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

April 10, 2026
Riot Blockchain Yearly Bitcoin Production Increases by 236%, Accumulates $194M in BTC
Blockchain

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

April 2, 2026
Galaxy Digital: Ethereum Developers Discuss Key Upgrades During Latest Consensus Call
Blockchain

Exploring Chainlink’s Role Beyond Price Feeds in the Blockchain Ecosystem

December 9, 2025
Next Post
Altcoin Markets Gearing Up for Q1 Hype Cycle Amid Strong Performance of Ethereum-Based Altcoin, Says Analyst

Economist Henrik Zeberg Says Altcoins Set To Go ‘Flying’ in Blow-Off Top Style Euphoric Bull Run

Exploring AssemblyAI’s Integrations: Enhancing Speech AI Workflows

Exploring AssemblyAI’s Integrations: Enhancing Speech AI Workflows

Recommended Stories

Can US-Iran new peace deal signal keep Bitcoin above $70,000?

Can US-Iran new peace deal signal keep Bitcoin above $70,000?

April 8, 2026
SEC Opens Proceedings on NYSE Proposal to List Grayscale Crypto ETF Options – Regulation Bitcoin News

SEC Opens Proceedings on NYSE Proposal to List Grayscale Crypto ETF Options – Regulation Bitcoin News

April 11, 2026
Treasury Proposes Stablecoin AML Rules as Bessent Vows to Protect US Financial System – Crypto News Bitcoin News

Treasury Proposes Stablecoin AML Rules as Bessent Vows to Protect US Financial System – Crypto News Bitcoin News

April 8, 2026

Popular Stories

  • Winklevoss Twins Continue Crypto Donation Spree With Another $1,000,000 in Bitcoin (BTC)

    Trader Says DeFi Altcoin Aave Witnessing Clear Trend Switch, Updates Forecast on Two Low-Cap Coins

    0 shares
    Share 0 Tweet 0
  • Huobi to Discontinue Cloud Wallet Service in May 2023

    0 shares
    Share 0 Tweet 0
  • Bitcoin Rejected at $29K, Arbitrum’s ARB Dumps 20% Daily: Weekend Watch

    0 shares
    Share 0 Tweet 0
  • FTX and Entertainment Giant Dolphin to Launch NFT Marketplace – Bitcoin News

    0 shares
    Share 0 Tweet 0
  • Privacy Is Key for Successful Digital Euro, Data Protection Body Says – Regulation Bitcoin News

    0 shares
    Share 0 Tweet 0
CryptoSpiel.com

This is an online news portal that aims to provide the latest crypto news, blockchain, regulations and much more stuff like that around the world. Feel free to get in touch with us!

What’s New Here!

  • Ripple CEO Says CLARITY Act Talks Near Breakthrough as Senate Standoff Eases
  • SEC Opens Proceedings on NYSE Proposal to List Grayscale Crypto ETF Options – Regulation Bitcoin News
  • Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development

Subscribe Now

Loading
  • Live Crypto Prices
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2021 - cryptospiel.com - All rights reserved!

No Result
View All Result
  • Home
  • Live Crypto Prices
  • Live ICO
  • Exchange
  • Crypto News
  • Bitcoin
  • Altcoins
  • Blockchain
  • Regulations
  • Trading
  • Scams

© 2021 - cryptospiel.com - All rights reserved!

Please enter CoinGecko Free Api Key to get this plugin works.