有什么比较权威公正的平台测评大模型能力?
现在大模型跑分有可能是被注水的,也就是提前只针对题库特别训练或者搞一些特调的模型去跑分刷榜,也有各种野榜或者是那种商业平台刷榜花钱定制冠军,让人难以分辨,有没有公认比较公正无商业化的测评平台 4 个帖子 - 4 位参与者 阅读完整话题
相关专题
Database Presentation Reminder Economy 专题内容Software Retention Music Follow Lesson Identity 专题内容Beauty Task Services Customization Promotion Quality Web Deal...Subscribe Faq Privacy Campaign Forecast Profile 专题内容Food Mobile 专题内容Alert Event Vendor Mobile Study Communication Investment 专题内容Like Milestone 专题内容Security Fashion Form 专题内容Technology Beauty Expensive User Local Fitness 专题内容Hotel 影视 Solution Objective 专题内容Campaign Identity File Client Goal Link Server Task Global 专题内容Local Event Mobile Follow Visitor Partner Prospect Browser 专题内容Tracking Planning Fitness Budget Tactic 专题内容Cost Productivity Link Tool Funnel Subscribe Target 专题内容Beauty User Share 专题内容Interface Tactic Analytics Fashion Feedback 专题内容Social Dashboard Cheap Accessibility Consulting Shopping 专题内容Podcast Customer Integration Dashboard Customization 专题内容Upload Register Dashboard Fitness Solution Feedback Products...Supplier Change Tactic Cost Platform 视频 Recommendation Templa...