Meta发布统一多模态音频分离模型SAM Audio

DoNews
Dec 17

2025年12月17日,Meta发布了首个统一的多模态音频分离模型SAM Audio,可通过文本、视觉或时间段提示从复杂音频中分离特定声音。该模型基于感知编码器视听(PE-AV)技术,支持点击视频中的物体、输入文本指令或标记时间范围来提取目标音频,如点击吉他分离其演奏声,或过滤播客中的狗叫噪音。Meta同时推出评估基准SAM Audio-Bench与自动评测模型SAM Audio Judge,并已...

Source Link

Disclaimer: Investing carries risk. This is not financial advice. The above content should not be regarded as an offer, recommendation, or solicitation on acquiring or disposing of any financial products, any associated discussions, comments, or posts by author or other users should not be considered as such either. It is solely for general information purpose only, which does not consider your own investment objectives, financial situations or needs. TTM assumes no responsibility or warranty for the accuracy and completeness of the information, investors should do their own research and may seek professional advice before investing.

Most Discussed

  1. 1
     
     
     
     
  2. 2
     
     
     
     
  3. 3
     
     
     
     
  4. 4
     
     
     
     
  5. 5
     
     
     
     
  6. 6
     
     
     
     
  7. 7
     
     
     
     
  8. 8
     
     
     
     
  9. 9
     
     
     
     
  10. 10