mirror of
https://github.com/labring/FastGPT.git
synced 2026-04-26 02:07:28 +08:00
87b0bca30c
* cloud doc * doc refactor * doc move * seo * remove doc * yml * doc * fix: tsconfig * fix: tsconfig
138 lines
3.7 KiB
Plaintext
138 lines
3.7 KiB
Plaintext
---
|
|
title: Integrating bge-rerank Reranking Model
|
|
description: Integrating bge-rerank reranking model with FastGPT
|
|
---
|
|
|
|
## Recommended Configuration by Model
|
|
|
|
| Model Name | RAM | VRAM | Disk Space | Start Command |
|
|
|------|---------|---------|----------|--------------------------|
|
|
| bge-reranker-base | >=4GB | >=4GB | >=8GB | python app.py |
|
|
| bge-reranker-large | >=8GB | >=8GB | >=8GB | python app.py |
|
|
| bge-reranker-v2-m3 | >=8GB | >=8GB | >=8GB | python app.py |
|
|
|
|
|
|
## Source Code Deployment
|
|
|
|
### 1. Environment Setup
|
|
|
|
- Python 3.9 or 3.10
|
|
- CUDA 11.7
|
|
- Network access to download models
|
|
|
|
### 2. Download Code
|
|
|
|
Code repositories for the 3 models:
|
|
|
|
1. [https://github.com/labring/FastGPT/tree/main/plugins/model/rerank-bge/bge-reranker-base](https://github.com/labring/FastGPT/tree/main/plugins/model/rerank-bge/bge-reranker-base)
|
|
2. [https://github.com/labring/FastGPT/tree/main/plugins/model/rerank-bge/bge-reranker-large](https://github.com/labring/FastGPT/tree/main/plugins/model/rerank-bge/bge-reranker-large)
|
|
3. [https://github.com/labring/FastGPT/tree/main/plugins/model/rerank-bge/bge-reranker-v2-m3](https://github.com/labring/FastGPT/tree/main/plugins/model/rerank-bge/bge-reranker-v2-m3)
|
|
|
|
### 3. Install Dependencies
|
|
|
|
```sh
|
|
pip install -r requirements.txt
|
|
```
|
|
|
|
### 4. Download Models
|
|
|
|
HuggingFace repositories for the 3 models:
|
|
|
|
1. [https://huggingface.co/BAAI/bge-reranker-base](https://huggingface.co/BAAI/bge-reranker-base)
|
|
2. [https://huggingface.co/BAAI/bge-reranker-large](https://huggingface.co/BAAI/bge-reranker-large)
|
|
3. [https://huggingface.co/BAAI/bge-reranker-v2-m3](https://huggingface.co/BAAI/bge-reranker-v2-m3)
|
|
|
|
Clone the model into the corresponding code directory. Directory structure:
|
|
|
|
```
|
|
bge-reranker-base/
|
|
app.py
|
|
Dockerfile
|
|
requirements.txt
|
|
```
|
|
|
|
### 5. Run
|
|
|
|
```bash
|
|
python app.py
|
|
```
|
|
|
|
On successful startup, you should see an address like this:
|
|
|
|

|
|
|
|
> `http://0.0.0.0:6006` is the connection address.
|
|
|
|
## Docker Deployment
|
|
|
|
**Image names:**
|
|
|
|
1. registry.cn-hangzhou.aliyuncs.com/fastgpt/bge-rerank-base:v0.1 (4 GB+)
|
|
2. registry.cn-hangzhou.aliyuncs.com/fastgpt/bge-rerank-large:v0.1 (5 GB+)
|
|
3. registry.cn-hangzhou.aliyuncs.com/fastgpt/bge-rerank-v2-m3:v0.1 (5 GB+)
|
|
|
|
**Port**
|
|
|
|
6006
|
|
|
|
**Environment Variables**
|
|
|
|
```
|
|
ACCESS_TOKEN=your_access_token (used in request header: Authorization: Bearer ${ACCESS_TOKEN})
|
|
```
|
|
|
|
**Run Command Example**
|
|
|
|
```sh
|
|
# auth token set to mytoken
|
|
docker run -d --name reranker -p 6006:6006 -e ACCESS_TOKEN=mytoken --gpus all registry.cn-hangzhou.aliyuncs.com/fastgpt/bge-rerank-base:v0.1
|
|
```
|
|
|
|
**docker-compose.yml Example**
|
|
```
|
|
version: "3"
|
|
services:
|
|
reranker:
|
|
image: registry.cn-hangzhou.aliyuncs.com/fastgpt/bge-rerank-base:v0.1
|
|
container_name: reranker
|
|
# GPU runtime. If the host doesn't have GPU drivers installed, comment out the deploy section.
|
|
deploy:
|
|
resources:
|
|
reservations:
|
|
devices:
|
|
- driver: nvidia
|
|
count: all
|
|
capabilities: [gpu]
|
|
ports:
|
|
- 6006:6006
|
|
environment:
|
|
- ACCESS_TOKEN=mytoken
|
|
|
|
```
|
|
## Integrate with FastGPT
|
|
|
|
1. Open the FastGPT model configuration and add a new reranking model.
|
|
2. Fill in the model configuration form: set the Model ID to `bge-reranker-base` and the address to `{{host}}/v1/rerank`, where host is your deployed domain or IP:Port.
|
|
|
|

|
|
|
|
## FAQ
|
|
|
|
### 403 Error
|
|
|
|
The custom request token in FastGPT does not match the ACCESS_TOKEN environment variable.
|
|
|
|
### Docker reports `Bus error (core dumped)`
|
|
|
|
Try adding the `shm_size` option to your `docker-compose.yml` to increase the shared memory size in the container.
|
|
|
|
```
|
|
...
|
|
services:
|
|
reranker:
|
|
...
|
|
container_name: reranker
|
|
shm_size: '2gb'
|
|
...
|
|
```
|