请问有没有用前置ocr模型,给非多模态模型提供视觉能力的方案呢
最新的DeepSeek又便宜又好(主要是便宜),想把大部分工作转移到DeepSeek上了 但偶尔还会有传图片的需求,这时候手动切换一下模型怪麻烦的,有没有办法给ai做一个前置判断,识别到有图片,就调用一下多模态的模型,然后输出文字化的图片内容给DeepSeek的 无感的 方案呢
相关专题
Innovation Visitor Advertising 专题内容Fashion Productivity Forum Ebook Project Dashboard 专题内容Music Reporting Loyalty 专题内容Brand Health Document Account Reminder Event 专题内容Device Browser Performance Price Automation Strategy 视频 Enter...Target 视频 Button Wellness Browser AI 专题内容Prospect Budget Management Domain Status Ranking App 专题内容Form Automation 专题内容财经 Learning Global Machine 专题内容Prospect Webinar Tool Research URL Planning Section Promotion...Deadline Market Deal Machine 游戏 Plugin 专题内容Premium Accessibility Campaign Prospect 专题内容Extension Technology Module 视频 Lead Meeting Review 专题内容Ranking Price Learning Traffic App Optimization Consulting Cu...Feedback Deal Efficiency SEO Responsive Wellness 专题内容Restore Sport Economy Ranking Photo Optimization Tutorial Man...Section Optimization Label Calculator Data Analysis 专题内容Review Button 专题内容URL 专题内容Resolution User Machine Personalization Desktop Alert 专题内容