new-api

Author	SHA1	Message	Date
CaIon	5238f279db	feat: record stream interruption reasons via StreamStatus Some checks failed Publish Docker image (Multi Registries, native amd64+arm64) / Build & push (amd64) [native] (push) Has been cancelled Details Publish Docker image (Multi Registries, native amd64+arm64) / Build & push (arm64) [native] (push) Has been cancelled Details Publish Docker image (Multi Registries, native amd64+arm64) / Create multi-arch manifests (Docker Hub) (push) Has been cancelled Details Build Electron App / build (windows-latest) (push) Has been cancelled Details Build Electron App / release (push) Has been cancelled Details Release (Linux, macOS, Windows) / Linux Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / macOS Release (push) Has been cancelled Details Release (Linux, macOS, Windows) / Windows Release (push) Has been cancelled Details - Add StreamStatus type (relay/common) to track stream end reason (done/timeout/client_gone/scanner_error/eof/panic/ping_fail) and accumulate soft errors during streaming via sync.Once + sync.Mutex. - Add StreamResult (relay/helper) as the callback interface: adapters call sr.Error() for soft errors, sr.Stop() for fatal, sr.Done() for normal completion. No early-return problem — multiple errors per chunk are naturally supported. - Refactor StreamScannerHandler callback from func(string) bool to func(string, *StreamResult). All 9 channel adapters updated. - Write stream_status into log other JSON field (admin-only) with status ok/error, end_reason, error_count, and error messages. - Frontend: display stream status in log detail expansion for admins.	2026-03-31 16:54:39 +08:00
CaIon	1fededceb3	feat: refactor token estimation logic - Introduced new OpenAI text models in `common/model.go`. - Added `IsOpenAITextModel` function to check for OpenAI text models. - Refactored token estimation methods across various channels to use estimated prompt tokens instead of direct prompt token counts. - Updated related functions and structures to accommodate the new token estimation approach, enhancing overall token management.	2025-12-02 21:34:39 +08:00
Seefs	5010f2d004	format: package name -> github.com/QuantumNous/new-api (#2017 )	2025-10-11 15:30:09 +08:00
RedwindA	78f34bd095	fix(openai): add nil checks for web_search streaming to prevent panic	2025-10-01 22:19:22 +08:00
creamlike1024	f6984272bf	fix: openai responses api 未统计图像生成调用计费	2025-09-16 12:47:59 +08:00
HynoR	23a82b9646	fix: enhance tool usage parsing with additional nil checks and error logging	2025-09-07 07:42:25 +08:00
HynoR	ab5351c270	fix: add error handling for missing built-in tools and validate response in stream handler	2025-09-05 13:58:24 +08:00
CaIon	906f797be5	refactor: replace json.Marshal with common.Marshal for consistency and error handling	2025-08-28 13:51:07 +08:00
CaIon	b0f86bd82e	fix: update PromptTokens assignment logic in relay_responses Some checks failed Publish Docker image (Multi Registries) / Push Docker image to multiple registries (push) Has been cancelled Details	2025-08-26 14:21:10 +08:00
CaIon	3e5bc637de	fix: Invalid type for 'input[x].summary': expected an array of objects, but got null instead Some checks failed Publish Docker image (Multi Registries) / Push Docker image to multiple registries (push) Has been cancelled Details	2025-08-26 13:17:31 +08:00
CaIon	4e3f008ae9	fix: update token usage calculation	2025-08-16 19:11:15 +08:00
CaIon	8284cff9b7	refactor: set prompt tokens when not provided in usage	2025-08-15 15:55:01 +08:00
CaIon	5084a43aa9	refactor: ensure graceful closure of response body in relay responses	2025-08-15 15:10:54 +08:00
CaIon	97ea8b6560	refactor: Introduce pre-consume quota and unify relay handlers This commit introduces a major architectural refactoring to improve quota management, centralize logging, and streamline the relay handling logic. Key changes: - Pre-consume Quota: Implements a new mechanism to check and reserve user quota before making the request to the upstream provider. This ensures more accurate quota deduction and prevents users from exceeding their limits due to concurrent requests. - Unified Relay Handlers: Refactors the relay logic to use generic handlers (e.g., `ChatHandler`, `ImageHandler`) instead of provider-specific implementations. This significantly reduces code duplication and simplifies adding new channels. - Centralized Logger: A new dedicated `logger` package is introduced, and all system logging calls are migrated to use it, moving this responsibility out of the `common` package. - Code Reorganization: DTOs are generalized (e.g., `dalle.go` -> `openai_image.go`) and utility code is moved to more appropriate packages (e.g., `common/http.go` -> `service/http.go`) for better code structure.	2025-08-14 20:05:06 +08:00
CaIon	c4666934be	Revert "feat: update Usage struct to support dynamic token handling with ceil function #1503 " This reverts commit 97b8d7de9ee2f94480c74ed6c51c16552f13ea99.	2025-08-07 16:22:40 +08:00
CaIon	97b8d7de9e	feat: update Usage struct to support dynamic token handling with ceil function #1503	2025-08-07 15:40:12 +08:00
creamlike1024	02fccf0330	fix: responses 流 cache token 未计费	2025-08-05 23:08:08 +08:00
creamlike1024	2d226a813e	fix: responses cache token 未计费	2025-08-05 22:56:27 +08:00
CaIon	563d056ff7	refactor: update error handling to support dynamic error types Some checks failed Publish Docker image (Multi Registries) / Push Docker image to multiple registries (push) Has been cancelled Details Linux Release / release (push) Has been cancelled Details macOS Release / release (push) Has been cancelled Details Windows Release / release (push) Has been cancelled Details	2025-07-31 21:16:01 +08:00
CaIon	2db71673a5	fix: auto ban	2025-07-30 18:39:19 +08:00
Xyfacai	dfc826a388	fix(response): tools 需要处理的参数很少使用 map	2025-07-20 11:21:57 +08:00
CaIon	4a9ca4209a	refactor: Introduce standardized API error This commit refactors the application's error handling mechanism by introducing a new standardized error type, `types.NewAPIError`. It also renames common JSON utility functions for better clarity. Previously, internal error handling was tightly coupled to the `dto.OpenAIError` format. This change decouples the internal logic from the external API representation. Key changes: - A new `types.NewAPIError` struct is introduced to serve as a canonical internal representation for all API errors. - All relay adapters (OpenAI, Claude, Gemini, etc.) are updated to return `*types.NewAPIError`. - Controllers now convert the internal `NewAPIError` to the client-facing `OpenAIError` format at the API boundary, ensuring backward compatibility. - Channel auto-disable/enable logic is updated to use the new standardized error type. - JSON utility functions are renamed to align with Go's standard library conventions (e.g., `UnmarshalJson` -> `Unmarshal`, `EncodeJson` -> `Marshal`).	2025-07-10 15:02:40 +08:00
CaIon	e2c34bdeb9	🐛 fix: refactor JSON unmarshalling across multiple handlers to use UnmarshalJson and UnmarshalJsonStr for consistency This update replaces instances of DecodeJson and DecodeJsonStr with UnmarshalJson and UnmarshalJsonStr in various relay handlers, enhancing code consistency and clarity in JSON processing. The changes improve maintainability and align with recent refactoring efforts in the codebase.	2025-06-28 00:02:07 +08:00
CaIon	aae9e81c20	🐛 fix: refactor response body handling in multiple relay handlers to utilize IOCopyBytesGracefully	2025-06-27 23:35:56 +08:00
CaIon	ae586e1be9	✨ feat: add CloseResponseBodyGracefully function to handle HTTP response body closure	2025-06-27 21:37:13 +08:00
CaIon	16997a695d	refactor: token counter logic	2025-06-21 00:54:40 +08:00
creamlike1024	a3fe88772f	feat: 添加 built in tools 计费 - 增加非流的工具调用次数统计 - 添加 web search 和 file search 计费	2025-05-06 21:58:01 +08:00
CaIon	e9812b0fd5	feat: implement OpenAI responses handling and streaming support with built-in tool tracking	2025-05-05 00:40:16 +08:00

28 Commits