mirror of https://github.com/labring/FastGPT.git synced 2025-10-13 22:56:28 +00:00

Go to file

chanzhi82020 31c17999b8 This PR introduces evaluation support designed specifically to track and benchmark applications built on the FastGPT platform. (#5476 )

- Adds a lightweight evaluation framework for app-level tracking and benchmarking.
- Changes: 28 files, +1455 additions, -66 deletions.
- Branch: add-evaluations -> main.
- PR: https://github.com/chanzhi82020/FastGPT/pull/1

Applications built on FastGPT need repeatable, comparable benchmarks to measure regressions, track improvements, and validate releases. This initial implementation provides the primitives to define evaluation scenarios, run them against app endpoints or model components, and persist results for later analysis.

I updated the PR description to emphasize that the evaluation system is targeted at FastGPT-built apps and expanded the explanation of the core pieces so reviewers understand the scope and intended use. The new description outlines the feature intent, core components, and how results are captured and aggregated for benchmarking.

- Evaluation definitions
- Define evaluation tasks that reference an app (app id, version, endpoint), test datasets or input cases, expected outputs (when applicable), and run configuration (parallelism, timeouts).
- Support for custom metric plugins so teams can add domain-specific measures.

- Runner / Executor
- Executes evaluation cases against app endpoints or internal model interfaces.
- Captures raw responses, response times, status codes, and any runtime errors.
- Computes per-case metrics (e.g., correctness, latency) immediately after each case run.

- Metrics & Aggregation
- Built-in metrics: accuracy/success rate, latency (p50/p90/p99), throughput, error rate.
- Aggregation produces per-run summaries and per-app historical summaries for trend analysis.
- Allows combining metrics into composite scores for high-level benchmarking.

- Persistence & Logging
- Stores run results, input/output pairs (when needed), timestamps, environment info, and app/version metadata so runs are reproducible and auditable.
- Logs are retained to facilitate debugging and root-cause analysis of regressions.

- Reporting & Comparison
- Produces aggregated reports suitable for CI gating, release notes, or dashboards.
- Supports comparing multiple app versions or deployments side-by-side.

- Extensibility & Integration
- Designed to plug into CI (automated runs on PRs or releases), dashboards, and downstream analysis tools.
- Easy to add new metrics, evaluators, or dataset connectors.

By centering the evaluation system on FastGPT apps, teams can benchmark full application behavior (not only raw model outputs), correlate metrics with deployment configurations, and make informed release decisions.

- Expand built-in metric suite (e.g., F1, BLEU/ROUGE where applicable), add dataset connectors, and provide example evaluation scenarios for sample apps.
- Integrate with CI pipelines and add basic dashboarding for trend visualization.

Related Issue: N/A

Co-authored-by: Archer <545436317@qq.com>

2025-09-16 15:20:59 +08:00

.claude

feature: V4.12.2 (#5525 )

2025-08-25 19:19:43 +08:00

.github

Update docs-deploy.yml (#5594 )

2025-09-04 23:03:47 +08:00

.husky

feature: V4.12.2 (#5525 )

2025-08-25 19:19:43 +08:00

.vscode

feature: V4.12.2 (#5525 )

2025-08-25 19:19:43 +08:00

bin

fix: the helm release failed due to version handle (#2199 )

2024-07-31 10:19:42 +08:00

deploy

perf: init shell (#5651 )

2025-09-15 22:21:24 +08:00

document

perf: init shell (#5651 )

2025-09-15 22:21:24 +08:00

packages

This PR introduces evaluation support designed specifically to track and benchmark applications built on the FastGPT platform. (#5476 )

2025-09-16 15:20:59 +08:00

plugins

V4.12.4 features (#5626 )

2025-09-15 20:02:54 +08:00

projects

This PR introduces evaluation support designed specifically to track and benchmark applications built on the FastGPT platform. (#5476 )

2025-09-16 15:20:59 +08:00

scripts

feature: V4.12.2 (#5525 )

2025-08-25 19:19:43 +08:00

test

V4.12.4 features (#5626 )

2025-09-15 20:02:54 +08:00

.dockerignore

4.6.5- CoreferenceResolution Module (#631 )

2023-12-22 10:47:31 +08:00

.eslintignore

remove old doc (#5305 )

2025-07-24 10:39:41 +08:00

.eslintrc.json

feat: update ESLint config with @typescript-eslint/consistent-type-imports (#4746 )

2025-05-06 17:33:09 +08:00

.gitignore

perf: rrf code (#5558 )

2025-08-29 01:24:19 +08:00

.imgbotconfig

docs: update font and cdn (#696 )

2024-01-05 18:02:53 +08:00

.npmrc

Plugin runtime (#2050 )

2024-07-15 22:50:48 +08:00

.prettierignore

4.11.2 dev (#5368 )

2025-08-02 19:38:37 +08:00

.prettierrc.js

README

2023-03-28 00:48:24 +08:00

CLAUDE.md

feature: V4.11.1 (#5350 )

2025-08-01 16:08:20 +08:00

dev.md

Feat: admin audit (#5068 )

2025-06-19 10:35:21 +08:00

env.d.ts

V4.11.0 features (#5270 )

2025-07-22 09:42:50 +08:00

LICENSE

4.11.2 dev (#5368 )

2025-08-02 19:38:37 +08:00

Makefile

V4.9.6 feature (#4565 )

2025-04-16 22:18:51 +08:00

package.json

V4.12.4 features (#5626 )

2025-09-15 20:02:54 +08:00

pnpm-lock.yaml

V4.12.4 features (#5626 )

2025-09-15 20:02:54 +08:00

pnpm-workspace.yaml

fix: downgrade md lib (#4508 )

2025-04-11 13:31:30 +08:00

README_en.md

4.11.2 dev (#5368 )

2025-08-02 19:38:37 +08:00

README_ja.md

4.11.2 dev (#5368 )

2025-08-02 19:38:37 +08:00

README.md

feat: Store pdfparse in local (#5534 )

2025-08-26 14:35:39 +08:00

SECURITY.md

perf: memory leak (#5370 )

2025-08-03 22:37:45 +08:00

tsconfig.json

Fix some bug (#5048 )

2025-06-17 16:10:01 +08:00

vitest.config.mts

V4.12.4 features (#5626 )

2025-09-15 20:02:54 +08:00

zhlint

4.8.10 perf (#2633 )

2024-09-06 17:22:24 +08:00

README_en.md

FastGPT

English | 简体中文 | 日语

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.

🎥 Comprehensive Feature Demonstration

https://github.com/labring/FastGPT/assets/15308462/7d3a38df-eb0e-4388-9250-2409bd33f6d4

🛸 Online Use

Website: fastgpt.io


Conversational AI Setup	Workflow Automation

Knowledge Base Setup	Integration Process

💡 Features

Features	Details
Application Orchestration Features	✅ Offers a straightforward mode, eliminating the need for complex orchestration ✅ Provides clear next-step instructions in dialogues ✅ Facilitates workflow orchestration ✅ Tracks references in source files ✅ Encapsulates modules for enhanced reuse at multiple levels ✅ Combines search and reordering functions 🔜 Includes a tool module 🔜 Integrates Laf for online HTTP module creation 🔜 Plugin encapsulation capabilities
Knowledge Base Features	✅ Allows for the mixed use of multiple databases ✅ Keeps track of modifications and deletions in data chunks ✅ Enables specific vector models for each knowledge base ✅ Stores original source files ✅ Supports direct input and segment-based QA import ✅ Compatible with a variety of file formats: pdf, docx, txt, html, md, csv ✅ Facilitates URL reading and bulk CSV importing 🔜 Supports PPT and Excel file import 🔜 Features a file reader 🔜 Offers diverse data preprocessing options
Application Debugging Features	✅ Enables targeted search testing within the knowledge base ✅ Allows feedback, editing, and deletion during conversations ✅ Presents the full context of interactions ✅ Displays all intermediate values within modules 🔜 Advanced Debug mode for orchestration
OpenAPI Interface	✅ The completions interface (aligned with GPT's chat mode interface) ✅ CRUD operations for the knowledge base 🔜 CRUD operations for conversation
Operational Features	✅ Share without requiring login ✅ Easy embedding with Iframe ✅ Customizable chat window embedding with features like default open, drag-and-drop ✅ Centralizes conversation records for review and annotation

👨‍💻 Development

Project tech stack: NextJs + TS + ChakraUI + MongoDB + PostgreSQL (PG Vector plug-in)/Milvus

⚡ Fast Deployment

When using Sealos services, there is no need to purchase servers or domain names. It supports high concurrency and dynamic scaling, and the database application uses the kubeblocks database, which far exceeds the simple Docker container deployment in terms of IO performance.

[![](https://cdn.jsdelivr.net/gh/labring-actions/templates@main/Deploy-on-Sealos.svg)](https://cloud.sealos.io/?openapp=system-fastdeploy%3FtemplateName%3Dfastgpt&uid=fnWRt09fZP)

Give it a 2-4 minute wait after deployment as it sets up the database. Initially, it might be a too slow since we're using the basic settings.

sealos one click deployment tutorial
Getting Started with Local Development
Deploying FastGPT
Guide on System Configs
Configuring Multiple Models
Version Updates & Upgrades

🤝 Third-party Ecosystem

luolinAI: Enterprise WeChat bot, ready to use

🏘️ Community & Support

🌐 Visit the FastGPT website for full documentation and useful links.
💬 Join our Discord server is to chat with FastGPT developers and other FastGPT users. This is a good place to learn about FastGPT, ask questions, and share your experiences.
🐞 Create GitHub Issues for bug reports and feature requests.

👀 Others

🌱 Contributors

We welcome all forms of contributions. If you are interested in contributing code, you can check out our GitHub Issues to show us your ideas.

🌟 Star History

📄 Usage Agreement

This repository complies with the FastGPT Open Source License open source agreement.

Direct commercial use as a backend service is allowed, but provision of SaaS services is not allowed.
Without commercial authorization, any form of commercial service must retain relevant copyright information.
For full details, please see FastGPT Open Source License
Contact: Dennis@sealos.io , click to view commercial version pricing strategy

Languages

JavaScript 52.1%

TypeScript 38.2%

MDX 5.3%

HTML 3.4%

Python 0.7%

README_en.md

FastGPT

🎥 Comprehensive Feature Demonstration

🛸 Online Use

💡 Features

👨‍💻 Development

💪 Related Projects

🤝 Third-party Ecosystem

🏘️ Community & Support

👀 Others

🌱 Contributors

🌟 Star History

📄 Usage Agreement