update doc search engine (#5386)

* update doc search engine * custom tokenizer * tokenizer
2025-10-19 01:54:04 +00:00 · 2025-08-04 22:07:52 +08:00
parent 545d8150f2
commit 6a0b0b1991
25 changed files with 432 additions and 324 deletions
--- a/document/content/docs/introduction/development/configuration.mdx
+++ b/document/content/docs/introduction/development/configuration.mdx
@@ -5,12 +5,14 @@ description: FastGPT 配置参数介绍

 由于环境变量不利于配置复杂的内容，新版 FastGPT 采用了 ConfigMap 的形式挂载配置文件，你可以在 `projects/app/data/config.json` 看到默认的配置文件。可以参考 [docker-compose 快速部署](/docs/development/docker/) 来挂载配置文件。

-**开发环境下**，你需要将示例配置文件 `config.json` 复制成 `config.local.json` 文件才会生效。  
+**开发环境下**，你需要将示例配置文件 `config.json` 复制成 `config.local.json` 文件才会生效。

 下面配置文件示例中包含了系统参数和各个模型配置：

 ## 4.8.20+ 版本新配置文件示例
+
 > 从4.8.20版本开始，模型在页面中进行配置。
+
 ```json
 {
  "feConfigs": {
@@ -22,7 +24,8 @@ description: FastGPT 配置参数介绍
    "vlmMaxProcess": 15, // 图片理解模型最大处理进程
    "tokenWorkers": 50, // Token 计算线程保持数，会持续占用内存，不能设置太大。
    "hnswEfSearch": 100, // 向量搜索参数，仅对 PG 和 OB 生效。越大，搜索越精确，但是速度越慢。设置为100，有99%+精度。
-    "customPdfParse": { // 4.9.0 新增配置
+    "customPdfParse": {
+      // 4.9.0 新增配置
      "url": "", // 自定义 PDF 解析服务地址
      "key": "", // 自定义 PDF 解析服务密钥
      "doc2xKey": "", // doc2x 服务密钥
@@ -57,7 +60,7 @@ description: FastGPT 配置参数介绍

 #### 2. 修改 FastGPT 配置文件

-开源版用户在 `config.json` 文件中添加 `systemEnv.customPdfParse.doc2xKey` 配置，并填写上申请到的 API Key。并重启服务。
+社区版用户在 `config.json` 文件中添加 `systemEnv.customPdfParse.doc2xKey` 配置，并填写上申请到的 API Key。并重启服务。

 商业版用户在 Admin 后台根据表单指引填写 Doc2x 服务密钥。