new-api

Author	SHA1	Message	Date
CaIon	fddf54ccc5	perf: reduce heap residency for large base64 relay requests Three layered optimizations targeting Gemini-style 5MB base64 payloads where RSS could balloon to tens of GB under concurrent load: 1. Byte-based param override (relay/common/override.go) - Switch legacy/operations hot paths from common.Marshal round-trips and map[string]any conversions to gjson/sjson on []byte directly. - Avoids cloning 5MB strings during each Set/Delete operation. 2. strings.Builder for Gemini response markdown (relay/channel/gemini/relay-gemini.go) - Replace string concatenation + strings.Join when assembling "![image](data:...;base64,DATA)" content for inline image responses. - Pre-allocates capacity from inline_data byte sizes. 3. Outbound BodyStorage + streaming Decoder (this commit's core) - New relay/common/outbound_body.go helper wraps marshaled upstream bodies in common.BodyStorage, allowing disk-cache mode to offload jsonData to a temp file while waiting for upstream TTFB. The original []byte can then be GC'd, removing ~5MB/req of heap residency during the longest window of a request. - All 7 relay handlers (gemini/claude/responses/embedding/image/compatible/ rerank) plus chat_completions_via_responses adopt the helper with defer closer.Close() and explicit jsonData = nil. - relay/common/relay_info.go: new UpstreamRequestBodySize so relay/channel/api_request.go can populate req.ContentLength (lost when body becomes a type-erased io.Reader). - common/gin.go UnmarshalBodyReusable: when storage is disk-backed and content-type is JSON, decode via DecodeJson(storage) instead of storage.Bytes()+Unmarshal, removing one transient 5MB copy per request. memory mode and form/multipart paths unchanged.	2026-05-22 19:08:38 +08:00
Seefs	0936e25046	perf: avoid eager formatting in debug log calls (#4929 )	2026-05-19 12:11:24 +08:00
heimoshuiyu	8ca103342d	fix: Message.ReasoningContent/Reasoning 改为 string，修复空思考内容在请求转发时被静默丢弃的问题问题：在非 passThrough 模式下，客户端发送的 reasoning_content: "" 经过 Go struct 反序列化再序列化后，因 string + omitempty 无法区分空串和字段缺失，导致空的思考内容被静默丢弃。根因： dto.Message.ReasoningContent 和 Message.Reasoning 使用 string（非指针）加 omitempty，违反 AGENTS.md Rule 6（可选标量字段必须用指针类型）。修复： 1. Message.ReasoningContent/Reasoning 类型从 string 改为 string - nil = 字段缺失 → JSON 省略 - &"" = 显式空串 → JSON 保留 reasoning_content: "" 2. 新增 Message.GetReasoningContent() 辅助方法 3. 更新所有读写处：relay-openai, relay-claude, relay-gemini, ollama 4. 新增测试覆盖空串保留、字段省略、getter 回退逻辑	2026-04-29 13:43:26 +08:00
CaIon	eab478bdc8	fix: miscellaneous quick fixes from CodeRabbit review - log_info_generate.go: add nil guard in InjectTieredBillingInfo - billing_expr_request.go: merge headers instead of replacing - go.mod: remove incorrect // indirect on expr-lang/expr - ToolPriceSettings.jsx: add null check in syncToVisual - tool_billing.go: fix PricePer1K for image_generation (per-call, not per-1K) - utils.jsx: add minute() to time condition regex - useUsageLogsData.jsx: pass displayMode to renderTieredModelPrice - AGENTS.md, CLAUDE.md: fix Rule 6/7 ordering - relay-gemini.go: add TEXT modality case in CandidatesTokensDetails	2026-04-24 00:34:06 +08:00
CaIon	4d2993e4cc	Merge remote-tracking branch 'origin/main' into nightly Some checks failed Release (Linux, macOS, Windows) / Linux Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / macOS Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / Windows Release (push) Has been cancelled Details # Conflicts: # web/src/helpers/render.jsx # web/src/hooks/usage-logs/useUsageLogsData.jsx # web/src/i18n/locales/en.json	2026-04-09 17:12:21 +08:00
CaIon	03758a4a85	refactor(file-source): unify file source creation and enhance caching mechanisms	2026-04-06 15:54:55 +08:00
RedwindA	79527c0ab1	feat: add HEIC/HEIF image format support Add detection, MIME type mapping, and dimension parsing for HEIC/HEIF images via ISOBMFF ftyp brand inspection and ispe box parsing. Update Gemini relay to accept these formats and refactor getImageConfig to properly retry decoders using buffered data.	2026-04-02 16:40:45 +08:00
CaIon	35d0704640	Merge branch 'origin/main' into nightly Resolve 4 conflicts: - relay/compatible_handler.go: accept main's refactor (postConsumeQuota -> service.PostTextConsumeQuota) - service/quota.go: accept main's PostClaudeConsumeQuota deletion, keep nightly's tiered billing in PostWssConsumeQuota and PostAudioConsumeQuota - web/src/i18n/locales/{en,zh-CN}.json: merge both sets of translation keys Post-merge integration: - Add tiered billing (TryTieredSettle, InjectTieredBillingInfo) to PostTextConsumeQuota - Update tool pricing calls to use nightly's generic GetToolPriceForModel/GetToolPrice API	2026-04-02 00:39:13 +08:00
CaIon	5238f279db	feat: record stream interruption reasons via StreamStatus Some checks failed Publish Docker image (Multi Registries, native amd64+arm64) / Build & push (amd64) [native] (push) Has been cancelled Details Publish Docker image (Multi Registries, native amd64+arm64) / Build & push (arm64) [native] (push) Has been cancelled Details Publish Docker image (Multi Registries, native amd64+arm64) / Create multi-arch manifests (Docker Hub) (push) Has been cancelled Details Build Electron App / build (windows-latest) (push) Has been cancelled Details Build Electron App / release (push) Has been cancelled Details Release (Linux, macOS, Windows) / Linux Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / macOS Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / Windows Release (push) Has been cancelled Details - Add StreamStatus type (relay/common) to track stream end reason (done/timeout/client_gone/scanner_error/eof/panic/ping_fail) and accumulate soft errors during streaming via sync.Once + sync.Mutex. - Add StreamResult (relay/helper) as the callback interface: adapters call sr.Error() for soft errors, sr.Stop() for fatal, sr.Done() for normal completion. No early-return problem — multiple errors per chunk are naturally supported. - Refactor StreamScannerHandler callback from func(string) bool to func(string, *StreamResult). All 9 channel adapters updated. - Write stream_status into log other JSON field (admin-only) with status ok/error, end_reason, error_count, and error messages. - Frontend: display stream status in log detail expansion for admins.	2026-03-31 16:54:39 +08:00
CaIon	c5405b2a12	feat: add billing expression system documentation and enhance tiered billing logic - Introduced a new rule for the Billing Expression System, emphasizing the importance of reading `pkg/billingexpr/expr.md` for dynamic billing. - Updated the billing expression logic to support new variables and improved handling of image and audio tokens. - Enhanced the tiered billing functionality with versioning support for expressions and refined quota calculations. - Added tests to validate the new billing expression features and ensure correctness in pricing calculations.	2026-03-17 16:59:25 +08:00
Seefs	2cf3c1836c	fix: preserve explicit zero values in native relay requests	2026-03-01 15:47:03 +08:00
Seefs	c97f4524f2	fix: unify usage mapping and include toolUsePromptTokenCount in input tokens	2026-02-17 15:45:14 +08:00
CaIon	92aca9771f	feat: refactor extra_body handling for improved configuration parsing	2026-02-11 22:15:22 +08:00
Seefs	99928bcfde	fix: charge local input tokens when Gemini returns empty response	2026-02-05 15:57:17 +08:00
CaIon	ffef331192	refactor(gemini): remove GeminiVisionMaxImageNum constant and related image count logic	2026-02-04 19:10:06 +08:00
CaIon	9ef9e78821	feat(file): unify file handling with a new FileSource abstraction for URL and base64 data	2026-02-04 18:23:17 +08:00
thirking	4108c404ee	fix: remove unnecessary unescapeMapOrSlice call in Gemini relay Some checks failed Publish Docker image (Multi Registries, native amd64+arm64) / Build & push (amd64) [native] (push) Has been cancelled Details Publish Docker image (Multi Registries, native amd64+arm64) / Build & push (arm64) [native] (push) Has been cancelled Details Publish Docker image (Multi Registries, native amd64+arm64) / Create multi-arch manifests (Docker Hub) (push) Has been cancelled Details Build Electron App / build (windows-latest) (push) Has been cancelled Details Build Electron App / release (push) Has been cancelled Details Release (Linux, macOS, Windows) / Linux Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / macOS Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / Windows Release (push) Has been cancelled Details The JSON serialization/deserialization already handles escape characters correctly, so the unescapeMapOrSlice function is redundant.	2026-02-03 11:47:45 +08:00
RedwindA	3985d10ae1	feat(gemini): support cached token billing	2026-02-01 22:50:47 +08:00
RedwindA	9899073ecb	feat(gemini): map OpenAI stop to Gemini stopSequences	2026-01-29 21:30:27 +08:00
Seefs	68e1e635e9	feat: logs show reject reason	2026-01-25 14:52:18 +08:00
Seefs	7af4d07843	fix: Charge locally even if there's an error	2026-01-25 14:32:51 +08:00
Seefs	c09c272574	Merge branch 'upstream-main' into fix/pr-2540 # Conflicts: # relay/channel/gemini/relay-gemini.go	2026-01-25 14:14:05 +08:00
Calcium-Ion	3f208ee365	Merge pull request #2701 from seefs001/fix/gemini-tool-call-index fix: calls to multiple tools in gemini all return index=0	2026-01-21 23:47:48 +08:00
Seefs	7e90c832e2	fix: issue where consecutive calls to multiple tools in gemini all returned an index of 0	2026-01-20 22:03:19 +08:00
Seefs	d694a197d2	fix: openAI function to gemini function field adjusted to whitelist mode	2026-01-15 13:26:42 +08:00
Seefs	14b3dac82c	fix: clean propertyNames for gemini function	2026-01-11 23:34:18 +08:00
RedwindA	db96248c5b	refactor(gemini): 更新 GeminiModelsResponse 以使用 dto.GeminiModel 类型	2026-01-09 18:08:11 +08:00
RedwindA	e8eea5d3ee	fix(gemini): fetch model list via native v1beta/models endpoint Use the native Gemini Models API (/v1beta/models) instead of the OpenAI-compatible path when listing models for Gemini channels, improving compatibility with third-party Gemini-format providers that don't implement OpenAI routes. - Add paginated model listing with timeout and optional proxy support - Select an enabled key for multi-key Gemini channels	2026-01-09 18:00:40 +08:00
Xyfacai	eeccb2146f	fix: 修复 gemini 文件类型不支持 image/jpg	2026-01-04 22:09:03 +08:00
Your Name	9c243d1fb8	feat(gemini): 支持 tool_choice 参数转换，优化错误处理	2025-12-27 18:33:09 +08:00
RedwindA	518563c7eb	feat: map OpenAI developer role to Gemini system instructions	2025-12-27 02:52:33 +08:00
Seefs	39df47486c	fix(gemini): handle minimal reasoning effort budget - Add minimal case to clampThinkingBudgetByEffort to avoid defaulting to full thinking budget	2025-12-18 08:10:46 +08:00
Calcium-Ion	4c54836a53	Merge pull request #2344 from seefs001/feature/gemini-thinking-level Some checks failed Publish Docker image (Multi Registries, native amd64+arm64) / Build & push (amd64) [native] (push) Has been cancelled Details Publish Docker image (Multi Registries, native amd64+arm64) / Build & push (arm64) [native] (push) Has been cancelled Details Publish Docker image (Multi Registries, native amd64+arm64) / Create multi-arch manifests (Docker Hub) (push) Has been cancelled Details feat: gemini 3 thinking level gemini-3-pro-preview-high	2025-12-02 21:55:43 +08:00
CaIon	1fededceb3	feat: refactor token estimation logic - Introduced new OpenAI text models in `common/model.go`. - Added `IsOpenAITextModel` function to check for OpenAI text models. - Refactored token estimation methods across various channels to use estimated prompt tokens instead of direct prompt token counts. - Updated related functions and structures to accommodate the new token estimation approach, enhancing overall token management.	2025-12-02 21:34:39 +08:00
CaIon	e19e9ad2fa	feat(gemini): implement markdown image handling in text processing	2025-12-01 17:54:41 +08:00
Seefs	607f7305b7	feat: gemini 3 thinking level gemini-3-pro-preview-high	2025-12-01 16:40:46 +08:00
CaIon	5d05cd9d32	feat(gemini): add validation and conversion for imageConfig parameters in extra_body	2025-11-30 19:31:08 +08:00
Seefs	c074ed2eb5	Revert "OAI生图接口支持gemini 3 pro image preview"	2025-11-30 18:49:18 +08:00
Calcium-Ion	d33e4c8d35	Merge pull request #2338 from QuantumNous/revert-2321-pr/gemini-image-edit Revert "Gemini Image系列支持图像编辑"	2025-11-30 18:48:01 +08:00
Seefs	b827d1f778	Revert "Gemini Image系列支持图像编辑"	2025-11-30 18:45:54 +08:00
Seefs	2fb1fa08d2	Revert "fix: gemini image correct generationConfig"	2025-11-30 18:45:23 +08:00
feitianbubu	fb55d56089	feat: gemini image support edit	2025-11-27 16:04:04 +08:00
feitianbubu	156229ce92	fix: gemini image correct generationConfig	2025-11-26 15:54:11 +08:00
feitianbubu	2d6f8fb58f	feat: gemini-3-pro-image-preview add extra param	2025-11-26 12:03:24 +08:00
Claude	7645300f96	feat: enable thoughtSignature for non-function-call messages Previously thoughtSignature was only attached to messages with function calls. This change extends the feature to also attach thoughtSignature to the first text part of assistant/model messages when no tool_calls are present, ensuring compatibility with Gemini thinking models in regular conversation scenarios.	2025-11-24 00:31:20 +00:00
CaIon	c8bbf7bed8	feat: Add ContextKeyLocalCountTokens and update ResponseText2Usage to use context in multiple channels	2025-11-21 18:17:01 +08:00
Seefs	4ac4849ab6	feat: Fill thoughtSignature only for Gemini/Vertex channels using the OpenAI format	2025-11-20 15:54:33 +08:00
CaIon	7391b15b4b	fix: remove redundant error handling for empty Gemini API response	2025-10-13 21:58:50 +08:00
CaIon	063926dd29	fix: update error messages for unsupported parameter names in Google extra body	2025-10-12 22:21:45 +08:00
Seefs	5010f2d004	format: package name -> github.com/QuantumNous/new-api (#2017 )	2025-10-11 15:30:09 +08:00

1 2 3 4

168 Commits