to - 钛刻 - 科技风向旗 - 深度刻画技术趋势,引领数字未来 - 第64页 - 钛刻科技

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

tech www.v2ex.com 2026-04-26 00:07:34+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.v2ex.com 2026-04-26 00:07:34+08:00

你们一个月上班要用多少 token?

如题，我上个月用了 200M ，主要是 GPT5.4 和 Gemini3.1Pro

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.v2ex.com 2026-04-26 00:00:55+08:00

TikTok、Visa 联手在英国推出借记卡：网红想提现收入不用再等上好几天了

IT之家 4 月 25 日消息，当地时间 4 月 24 日，据英国《卫报》报道，TikTok 和 Visa 在英国推出了一张专门面向内容创作者的借记卡，主打让创作者更快拿到平台收入。这张创作者卡主要服务于通过 TikTok Live 直播服务赚钱的人。创作者可以在直播中

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.ithome.com 2026-04-25 23:52:20+08:00

小游戏出海

大家好，最近在研究小游戏出海 tiktok ，目前海外主体在资质审核阶段。希望加入相关的社群，或者我来创建一个组织。有希望了解这块儿的或者已经在从事的可以留微我加你，或者你拉我。感谢大家。base64：bHNsYmxvZw==

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.v2ex.com 2026-04-25 23:47:45+08:00

小游戏出海

大家好，最近在研究小游戏出海 tiktok ，目前海外主体在资质审核阶段。希望加入相关的社群，或者我来创建一个组织。有希望了解这块儿的或者已经在从事的可以留微我加你，或者你拉我。感谢大家。base64：bHNsYmxvZw==

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.v2ex.com 2026-04-25 23:47:45+08:00

你们一个月上班要用多少 token?

如题，我上个月用了 200M ，主要是 GPT5.4 和 Gemini3.1Pro

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.v2ex.com 2026-04-25 23:46:16+08:00

日产 CEO 埃斯皮诺萨：CVT 变速箱仍有意义，但不会用在性能车上

IT之家 4 月 25 日消息，据外媒 motor1 今日报道，日产没少被 CVT 变速箱困扰。2025 年，日产就与车主达成了一起集体诉讼和解，向 Maxima 和 Murano（直译为“西玛”“楼兰”）车主支付 300 万美元（IT之家注：现汇率约合 2054.1 万元人民币

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.ithome.com 2026-04-25 23:24:24+08:00

AMD 锐龙 9 9950X3D2 评测遇冷，但首发仍冲入亚马逊热销 Top10

IT之家 4 月 25 日消息，AMD 首款双芯 3D V-Cache 处理器锐龙 9 9950X3D2 本周三正式开售，但因为 AMD 故意未向多个知名评测机构提供样片而遭受质疑，同时其他媒体给出的评测也普遍不乐观。简单来说，9950X3D2 游戏性能提升微乎其微，它在渲染、

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.ithome.com 2026-04-25 23:19:24+08:00

为了买到便宜靠谱的 Token，少被割韭菜，我做了个比价的网站

最近在开发一个小项目，想买几个 ChatGPT 账号。但是在多个卡网和电报群里转了几圈，发现同一种类型的账号，比如 ChatGPT plus 月卡，就有 N 多种价格。有的几块钱，有的卖 15 ，有的卖 30 ，有的卖 40. 后来我才知道，这些店铺和卖家的来源其实就是那么几

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.v2ex.com 2026-04-25 23:17:27+08:00

为了买到便宜靠谱的 Token，少被割韭菜，我做了个比价的网站

最近在开发一个小项目，想买几个 ChatGPT 账号。但是在多个卡网和电报群里转了几圈，发现同一种类型的账号，比如 ChatGPT plus 月卡，就有 N 多种价格。有的几块钱，有的卖 15 ，有的卖 30 ，有的卖 40. 后来我才知道，这些店铺和卖家的来源其实就是那么几

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.v2ex.com 2026-04-25 23:06:09+08:00

5.4降智的没法用,今天转头用5.5还算不错.OAI 开新模型老模型直接不给算力是吧?5Etoken今日使用

用户明确指出工具来源了。我先按开发要求用 tool search 查出这个延迟工具的准确名字和调用方式，再继续 csv-todo-loop。如果能拿到 codebase-retrieval，就从 A1 开始真正施工。以前从没有这样过. 睁眼说瞎说! 今天使用了 2E 5.4 2

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech linux.do 2026-04-25 22:51:48+08:00

你们一个月上班要用多少 token?

如题，我上个月用了 200M ，主要是 GPT5.4 和 Gemini3.1Pro

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.v2ex.com 2026-04-25 22:46:20+08:00

你们一个月上班要用多少 token?

如题，我上个月用了 200M ，主要是 GPT5.4 和 Gemini3.1Pro

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.v2ex.com 2026-04-25 22:33:55+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.v2ex.com 2026-04-25 22:29:34+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.v2ex.com 2026-04-25 22:29:34+08:00

[程序员] 你们一个月上班要用多少 token?

如题，我上个月用了 200M ，主要是 GPT5.4 和 Gemini3.1Pro

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech v2ex.com 2026-04-25 22:22:06+08:00

为了买到便宜靠谱的 Token，少被割韭菜，我做了个比价的网站

最近在开发一个小项目，想买几个 ChatGPT 账号。但是在多个卡网和电报群里转了几圈，发现同一种类型的账号，比如 ChatGPT plus 月卡，就有 N 多种价格。有的几块钱，有的卖 15 ，有的卖 30 ，有的卖 40. 后来我才知道，这些店铺和卖家的来源其实就是那么几

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.v2ex.com 2026-04-25 22:18:20+08:00

hermes 我真的力竭了！

今天和hermes对线了半个小时起因是我的server的key我忘了记在哪里了，图省事让hermes告诉我，他告诉我是autoteammasterkey_2026，我尝试了无数次也让他验证了无数次，他始终告诉我就是这个key，可我就是登录不上啊最终我找到了我的笔记，是auto

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech linux.do 2026-04-25 22:06:46+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Tool Community Social Account Reminder 专题内容 Vacation Client Upload Business Conversion Metric Rating Loya...Alert Comment Template Hosting App 专题内容 Premium Module Course 专题内容 Mobile Growth Milestone Analysis Profit Recipe AI About Incom...Quality Shopping 专题内容 Deal Prospect Networking Vendor Fashion Workshop Campaign Lab...Hotel Help Automation Device Expensive Notification Engagemen...Investment Tracking 专题内容 Website Target 专题内容 Software Media Resource 专题内容 Behavior Growth Ranking Analytics Login 专题内容 Layout Mobile Image 专题内容 Data About Support 专题内容 Blog Entertainment Audience Growth Consulting Network Alert 专题内容 Logo Fashion Event Saving Subscribe Automation Objective 专题内容 Coupon 专题内容 Link Premium Saving Customization Planning Study Download 财经...Segment Plugin Restore Machine Alert 专题内容 Budget Efficiency Performance Integration Sync Landing Lesson...

tech www.v2ex.com 2026-04-25 22:05:33+08:00

to - 钛刻 - 科技风向旗 - 深度刻画技术趋势,引领数字未来 - 第64页 - 钛刻科技 | TCTI.cn

相关标签