new-api

Author	SHA1	Message	Date
Calcium-Ion	d604f48c06	Merge pull request #4469 from seefs001/fix/tool-arguments-object fix: support raw JSON response tool arguments	2026-04-26 20:20:03 +08:00
Seefs	db89b57e1c	fix: support raw JSON response tool arguments	2026-04-26 13:47:37 +08:00
Seefs	62d4b63fc3	feat: configure native messages model matching	2026-04-26 13:37:59 +08:00
HynoR	435d7ae0dd	feat: support DeepSeek V4 reasoning suffix handling	2026-04-24 16:50:35 +08:00
CaIon	eab478bdc8	fix: miscellaneous quick fixes from CodeRabbit review - log_info_generate.go: add nil guard in InjectTieredBillingInfo - billing_expr_request.go: merge headers instead of replacing - go.mod: remove incorrect // indirect on expr-lang/expr - ToolPriceSettings.jsx: add null check in syncToVisual - tool_billing.go: fix PricePer1K for image_generation (per-call, not per-1K) - utils.jsx: add minute() to time condition regex - useUsageLogsData.jsx: pass displayMode to renderTieredModelPrice - AGENTS.md, CLAUDE.md: fix Rule 6/7 ordering - relay-gemini.go: add TEXT modality case in CandidatesTokensDetails	2026-04-24 00:34:06 +08:00
CaIon	6bde1a9c8d	Merge origin/main into nightly Resolve conflicts: - .gitignore: keep nightly additions (.test, skills-lock.json) - relay/helper/price.go: keep both billingexpr and model imports - en.json / zh-CN.json: keep nightly's superset of i18n entries - service/billing_session.go: add missing 3rd arg to DecreaseUserQuota - en.json / zh-CN.json: deduplicate 129+320 duplicate i18n keys	2026-04-23 21:37:03 +08:00
papersnake	47d7bca268	feat: support claude-opus-4-7 (#4293 ) * feat: support claude-opus-4-7 * feat: summarized display for opus 4.7	2026-04-17 13:52:34 +08:00
CaIon	3cad6b9d7f	fix(claude): improve handling of empty string content in OpenAI to Claude message conversion Some checks failed Publish Docker image (Multi Registries, native amd64+arm64) / Build & push (amd64) [native] (push) Has been cancelled Details Publish Docker image (Multi Registries, native amd64+arm64) / Build & push (arm64) [native] (push) Has been cancelled Details Publish Docker image (Multi Registries, native amd64+arm64) / Create multi-arch manifests (Docker Hub) (push) Has been cancelled Details Build Electron App / build (windows-latest) (push) Has been cancelled Details Build Electron App / release (push) Has been cancelled Details Release (Linux, macOS, Windows) / Linux Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / macOS Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / Windows Release (push) Has been cancelled Details	2026-04-16 17:44:38 +08:00
woan1136	3ab65a8221	fix: add Azure channel support for /v1/responses/compact URL routing (#4149 ) Some checks failed Publish Docker image (Multi Registries, native amd64+arm64) / Build & push (amd64) [native] (push) Has been cancelled Details Publish Docker image (Multi Registries, native amd64+arm64) / Build & push (arm64) [native] (push) Has been cancelled Details Publish Docker image (Multi Registries, native amd64+arm64) / Create multi-arch manifests (Docker Hub) (push) Has been cancelled Details Build Electron App / build (windows-latest) (push) Has been cancelled Details Build Electron App / release (push) Has been cancelled Details Release (Linux, macOS, Windows) / Linux Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / macOS Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / Windows Release (push) Has been cancelled Details The Azure channel's GetRequestURL method only handled RelayModeResponses but missed RelayModeResponsesCompact. This caused compact requests to fall through to the generic deployments URL pattern, producing an incorrect path that Azure returns 404 for. This fix extends the existing responses API special handling to also cover the compact mode, appending /compact to the subUrl when the relay mode is ResponsesCompact. Affected URLs (before → after): - Normal Azure: /openai/deployments/{model}/responses/compact → /openai/v1/responses/compact - cognitiveservices: same pattern → /openai/responses/compact - Custom AzureResponsesVersion: properly respected for compact too Co-authored-by: 彭俊杰 <pengjunjie@onero.com>	2026-04-13 15:23:38 +08:00
CaIon	8b22161527	fix: set TopP to nil in Claude request configuration	2026-04-13 14:36:22 +08:00
CaIon	4d2993e4cc	Merge remote-tracking branch 'origin/main' into nightly Some checks failed Release (Linux, macOS, Windows) / Linux Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / macOS Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / Windows Release (push) Has been cancelled Details # Conflicts: # web/src/helpers/render.jsx # web/src/hooks/usage-logs/useUsageLogsData.jsx # web/src/i18n/locales/en.json	2026-04-09 17:12:21 +08:00
Calcium-Ion	b07f0b9626	Merge pull request #4154 from seefs001/feature/vllm-extensions-params feat: fill in some custom fields for vllm-omini	2026-04-09 14:35:05 +08:00
Calcium-Ion	53cf37a469	fix(ali): accept string usage values in task polling (#4155 )	2026-04-09 14:34:44 +08:00
NyaMisty	160cb28572	fix(zhipu_4v): use correct endpoint for coding plan image generation (#4146 )	2026-04-09 14:33:48 +08:00
Seefs	274307b0a9	fix(ali): accept string usage values in task polling	2026-04-09 12:48:17 +08:00
Seefs	a19a63b98c	feat: fill in some custom fields for vllm-omini.	2026-04-09 12:41:51 +08:00
forsakenyang	c734db34e8	feat: add minimax image generation relay support (#4103 )	2026-04-08 16:57:44 +08:00
zuiho	c66636a0c7	fix: 采纳 CodeRabbit 建议，!Done 时也用 fallback 覆盖占位 CompletionTokens message_start 阶段可能给 CompletionTokens 非零占位值，只检查 == 0 不够，加上 !Done && fallback > current 条件。	2026-04-07 17:52:11 +08:00
zuiho	f7cdc727df	fix: Claude 流式断流时不再整份覆盖 usage，保留 cache 计费字段 HandleStreamFinalResponse 在 !Done 时调用 ResponseText2Usage 整份覆盖 claudeInfo.Usage，导致 message_start 已获取的 CacheReadInputTokens、 CacheCreationInputTokens 等字段丢失，prompt 退化为占位值 1。修复： - 只补缺失的 CompletionTokens/PromptTokens，保留已有 cache 数据 - PromptTokens 兜底改用 info.GetEstimatePromptTokens()（与其他渠道对齐） Fixes #4127	2026-04-07 17:41:08 +08:00
CaIon	03758a4a85	refactor(file-source): unify file source creation and enhance caching mechanisms	2026-04-06 15:54:55 +08:00
CaIon	8fc0eb78e2	feat(billing): enhance task billing process with video input detection and updated pricing logic - Added `EstimateBilling` function to check for video input in request metadata and return corresponding discount ratios. - Updated `ModelPriceHelperPerCall` to incorporate new pricing logic based on model ratios and video input. - Enhanced task billing logs to include model ratio information and adjusted calculations for actual quota based on additional multipliers. - Introduced `renderTaskBillingProcess` to improve rendering of task billing information in the UI.	2026-04-06 15:54:55 +08:00
Seefs	82c2008d2c	fix: emit claude message_delta for usage-only final stream chunk	2026-04-04 20:21:13 +08:00
CaIon	bb5b9eaca2	fix(relay-claude): set TopP to nil in Claude request to align with API requirements	2026-04-03 20:18:28 +08:00
Calcium-Ion	9816ad87e3	feat: add HEIC/HEIF image format support for Gemini channel (#4049 ) * feat: add HEIC/HEIF image format support Add detection, MIME type mapping, and dimension parsing for HEIC/HEIF images via ISOBMFF ftyp brand inspection and ispe box parsing. Update Gemini relay to accept these formats and refactor getImageConfig to properly retry decoders using buffered data. * fix: handle ISOBMFF extended sizes in HEIF dimension parser parseHEIFDimensions now correctly handles boxSize==1 (64-bit extended size) and boxSize==0 (box-to-EOF), preventing the parser from breaking out of the loop when encountering these valid ISOBMFF box headers before reaching the meta box.	2026-04-02 21:32:42 +08:00
Calcium-Ion	0193018af6	Merge pull request #4042 from feitianbubu/pr/fe9713dcbf8795e127fbea2fcb1f3011da86ad54 新增seedance2.0视频接口支持	2026-04-02 21:30:31 +08:00
RedwindA	79527c0ab1	feat: add HEIC/HEIF image format support Add detection, MIME type mapping, and dimension parsing for HEIC/HEIF images via ISOBMFF ftyp brand inspection and ispe box parsing. Update Gemini relay to accept these formats and refactor getImageConfig to properly retry decoders using buffered data.	2026-04-02 16:40:45 +08:00
Calcium-Ion	41cd051ea9	Merge pull request #3505 from seefs001/fix/claude-media-support fix: add basic inline file support for Claude relay	2026-04-02 13:29:21 +08:00
Seefs	c04f82bfb5	TODO: fix chat -> messages file type	2026-04-02 13:16:58 +08:00
feitianbubu	dafc7618c3	feat: add seedance fail reason	2026-04-02 12:26:44 +08:00
feitianbubu	22692b3f87	feat: seedance support seconds	2026-04-02 12:26:37 +08:00
feitianbubu	d36e892905	fix: seedance only one text	2026-04-02 12:26:33 +08:00
feitianbubu	3cd1ba4673	fix: seedance metadata override prompt	2026-04-02 12:26:27 +08:00
feitianbubu	b7c0f754ad	feat: add seedance2.0 video api	2026-04-02 12:24:24 +08:00
CaIon	35d0704640	Merge branch 'origin/main' into nightly Resolve 4 conflicts: - relay/compatible_handler.go: accept main's refactor (postConsumeQuota -> service.PostTextConsumeQuota) - service/quota.go: accept main's PostClaudeConsumeQuota deletion, keep nightly's tiered billing in PostWssConsumeQuota and PostAudioConsumeQuota - web/src/i18n/locales/{en,zh-CN}.json: merge both sets of translation keys Post-merge integration: - Add tiered billing (TryTieredSettle, InjectTieredBillingInfo) to PostTextConsumeQuota - Update tool pricing calls to use nightly's generic GetToolPriceForModel/GetToolPrice API	2026-04-02 00:39:13 +08:00
Calcium-Ion	7efb1922fe	Merge pull request #3526 from feitianbubu/pr/e560265b6e57aa7b95bc98cb53397ef0a3082d9d 支持wan2.7生图-wan2.7-image	2026-04-02 00:15:04 +08:00
Calcium-Ion	89fe99f3bd	Merge pull request #3512 from imlhb/patch-2 fix: prevent double-counting of image count n in billing	2026-04-02 00:14:39 +08:00
feitianbubu	e5b5331d3b	feat: wan 2.7 support N for gen images number	2026-04-01 17:39:50 +08:00
feitianbubu	18373c6eac	feat: add wan 2.7	2026-04-01 17:39:11 +08:00
CaIon	ab99c30884	fix: move image count n to OtherRatio to prevent double-counting The previous commit commented out AddOtherRatio("n") in the Ali adaptor to fix double-counting but this could cause billing evasion when n is specified via extra["parameters"] instead of request.N. Root cause: ImagePriceRatio in GetTokenCountMeta() already included n, AND channel adaptors added OtherRatio("n"), resulting in n² billing. Proper fix: - Remove n from ImagePriceRatio (keep sizeRatio * qualityRatio only) - In ImageHelper, add default OtherRatio("n") when adaptor hasn't set one; set fallback tokens to 1 (base unit) - Restore Ali adaptor's AddOtherRatio("n") — it uses actual upstream parameters/response count, preventing billing evasion	2026-03-31 23:58:10 +08:00
CaIon	d22f889e5d	fix(xAI): set MaxTokens to nil when MaxCompletionTokens is 0 for grok-3-mini model	2026-03-31 19:16:16 +08:00
刘泓宾	53aeee4ff7	Comment out price data adjustment logic Comment out code that modifies price data based on image count. 测试发现，如果是接入阿里百炼平台的qwen-image-2.0系列模型，这边计费的时候会出现 0.2n倍率n的情况，最前面的0.2n会直接显示为模型价格。例如：日志详情模型价格 $0.600000，专属倍率 0.3 其他详情大小 10801920, 品质 standard, 生成数量 3, 其他倍率 n: 3.000000 计费过程模型价格：$0.600000 专属倍率：0.3 = $0.180000	2026-03-31 17:12:06 +08:00
CaIon	5238f279db	feat: record stream interruption reasons via StreamStatus Some checks failed Publish Docker image (Multi Registries, native amd64+arm64) / Build & push (amd64) [native] (push) Has been cancelled Details Publish Docker image (Multi Registries, native amd64+arm64) / Build & push (arm64) [native] (push) Has been cancelled Details Publish Docker image (Multi Registries, native amd64+arm64) / Create multi-arch manifests (Docker Hub) (push) Has been cancelled Details Build Electron App / build (windows-latest) (push) Has been cancelled Details Build Electron App / release (push) Has been cancelled Details Release (Linux, macOS, Windows) / Linux Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / macOS Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / Windows Release (push) Has been cancelled Details - Add StreamStatus type (relay/common) to track stream end reason (done/timeout/client_gone/scanner_error/eof/panic/ping_fail) and accumulate soft errors during streaming via sync.Once + sync.Mutex. - Add StreamResult (relay/helper) as the callback interface: adapters call sr.Error() for soft errors, sr.Stop() for fatal, sr.Done() for normal completion. No early-return problem — multiple errors per chunk are naturally supported. - Refactor StreamScannerHandler callback from func(string) bool to func(string, *StreamResult). All 9 channel adapters updated. - Write stream_status into log other JSON field (admin-only) with status ok/error, end_reason, error_count, and error messages. - Frontend: display stream status in log detail expansion for admins.	2026-03-31 16:54:39 +08:00
Seefs	263b9bc695	fix: add basic inline file support for Claude relay	2026-03-30 19:37:21 +08:00
feitianbubu	62b9aaa520	feat: prevent metadata from overriding model fields	2026-03-27 15:31:41 +08:00
Calcium-Ion	0191a68d4e	Merge pull request #3400 from seefs001/fix/openai-usage refactor: optimize billing flow for OpenAI-to-Anthropic convert	2026-03-23 15:03:57 +08:00
Calcium-Ion	763c3ff709	Merge pull request #3331 from seefs001/fix/claude-beta-query fix: apply forced beta query at final upstream URL stage	2026-03-23 15:03:36 +08:00
Seefs	9ecad90652	refactor: optimize billing flow for OpenAI-to-Anthropic convert	2026-03-23 14:22:12 +08:00
wenyifan	2c3ae32c8e	fix map	2026-03-20 16:48:04 +08:00
wenyifan	498199b37d	fix code quality	2026-03-20 16:38:48 +08:00
wenyifan	ff29900f30	feat: Add support for counting cache-hit tokens in llama.cpp OpenAI-Compatible API	2026-03-20 16:10:18 +08:00

1 2 3 4 5 ...

1135 Commits