本地 - 钛刻 - 科技风向旗 - 深度刻画技术趋势,引领数字未来 - 第20页 - 钛刻科技

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

tech www.v2ex.com 2026-04-26 01:07:34+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-26 00:07:34+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-26 00:07:34+08:00

让网页端的 deepseek 变成猫娘帮你做事吧～

新增提示词注入功能 [deepseek 网页增强脚本 V2] 支持网页端调用本地工具！资源荟萃本帖使用社区开源推广，符合推广要求。我申明并遵循社区要求的以下内容：我的帖子已经打上开源推广标签：是我的开源项目完整开源，无未开源部分：是我的开源项目已链接认可 LIN

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech linux.do 2026-04-25 22:45:37+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-25 22:29:34+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-25 22:29:34+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-25 22:05:33+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-25 22:05:33+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-25 21:45:41+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-25 21:45:41+08:00

[deepseek 网页增强脚本 V2] 支持网页端调用本地工具！

本帖使用社区开源推广，符合推广要求。我申明并遵循社区要求的以下内容：我的帖子已经打上开源推广标签：是我的开源项目完整开源，无未开源部分：是我的开源项目已链接认可 LINUX DO 社区：是我帖子内的项目介绍，AI生成、润色内容部分已截图发出：是以上选择我承诺

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech linux.do 2026-04-25 21:13:35+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-25 20:44:05+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-25 20:44:05+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-25 20:44:05+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-25 20:44:05+08:00

真是赶上好时候了，不过也不知道算不算是好时候吧

网页端几十分钟就能生成这26页的PPT（原生可编辑），本地agent很快就能生成双栏8页的ieee trans期刊的排版，虽然说肯定是比较粗糙的，但是感觉确实对效率的提高很大 3 个帖子 - 3 位参与者阅读完整话题

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech linux.do 2026-04-25 20:33:43+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-25 20:20:13+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-25 20:06:01+08:00

【低价宽带】本地帮派太有礼貌了

父母家装宽带，其实用量不大的，能刷视频就行。高强度网上冲浪了一下午，电商咸鱼xhs都问了一圈，机构最少都480一年300m。受站内一个帖子启发，直接找安装师傅问价，果然本地师傅就是有路子。360包一年500m家宽和一张流量卡整年月租，免押光猫和WiFi6路由器，还免了安装费，服

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech linux.do 2026-04-25 19:05:45+08:00

我做了个工具让 8GB 显卡跑 30B 模型从 3 tok/s 提到 21 tok/s，记录一下技术发现

最近在折腾本地大模型，发现一个核心问题：Ollama 和 LM Studio 能让模型跑起来，但参数全靠猜——上下文长度、KV cache 类型、MoE expert 放哪、ubatch 多大……用默认参数基本是在浪费显卡。于是做了个工具自动找最优配置，过程中踩了不少坑，记录一

相关专题

Terms Restore Download Alert Template Community Objective 专题内容 Profit Global Investment Success 专题内容 Platform 专题内容 Value Customer Seminar Conversion Traffic 专题内容 Desktop 专题内容 Reminder Revenue Strategy Desktop Review Funnel Progress Trav...Customer Dashboard Tutorial Personalization Solution 专题内容 Rating Collaboration Partner Communication 专题内容 Logo Luxury Research Help Discovery Unsubscribe Brand 专题内容 Value Plugin Technology Networking Notification 影视 Fashion 视频...Identity Settings Tracking 专题内容 Campaign Policy 视频 Growth Machine Metric Fitness Task 专题内容 Progress AI Ebook Project Planning Engagement Policy User Exp...Settings Sales Music Affordable Project 专题内容 Event 专题内容 Case Site Spreadsheet 专题内容 Image Study Document Analysis Audience 专题内容 Promotion 专题内容 Digital Market 专题内容 Optimization API Food Analytics Tactic Design Partner 专题内容

tech www.v2ex.com 2026-04-25 19:04:06+08:00

本地 - 钛刻 - 科技风向旗 - 深度刻画技术趋势,引领数字未来 - 第20页 - 钛刻科技 | TCTI.cn

相关标签