FMBench: Adaptive Large Language Model Output Formatting

This repository contains the SFT+GRPO finetuning implementation for openPangu with the FMBench dataset.

⚙️ Installation

Step 1: Install the environment

conda env create -f environment_pangu_grpo_root.yml

Step 2: Replace the installed transformers package with our custom source code.

🤗 Setup

Step1: Download the longformer weight checkpoint archive at Modelscopes and extract it to the project directory.

Step2: Download the pangu weight checkpoint archive at Modelscopes and extract it to the project directory.

📌 Getting Started

Note: The scripts in this repository use the 1B model as an example. To run the 7B model, please update the command-line arguments accordingly.

Supervised Fine-tuning

bash run_sft.sh

SFT+GRPO Fine-tuning

bash run_sft_grpo_8npu.sh

GRPO Fine-tuning

bash run_grpo.sh

Run Inference & Evaluation

bash run_inf.sh
bash run_eval.sh

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
__pycache__		__pycache__
accelerate_configs		accelerate_configs
config		config
data_json		data_json
framework		framework
kernel_meta		kernel_meta
transformers		transformers
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
accelerate_quick_start.md		accelerate_quick_start.md
environment_dev3.yml		environment_dev3.yml
environment_pangu_grpo_backup_do_not_use.yml		environment_pangu_grpo_backup_do_not_use.yml
environment_pangu_grpo_root.yml		environment_pangu_grpo_root.yml
eval_log.txt		eval_log.txt
evaluate_outputs.py		evaluate_outputs.py
fmbdata.py		fmbdata.py
fusion_result.json		fusion_result.json
generate_variants.py		generate_variants.py
helper.py		helper.py
run_eval.sh		run_eval.sh
run_eval_1b.py		run_eval_1b.py
run_eval_7b.py		run_eval_7b.py
run_grpo_8npu.sh		run_grpo_8npu.sh
run_inf.py		run_inf.py
run_inf.sh		run_inf.sh
run_sft.sh		run_sft.sh
run_sft_grpo_8npu.sh		run_sft_grpo_8npu.sh
train_grpo_batch.py		train_grpo_batch.py
train_sft.py		train_sft.py
train_sft_grpo_batch.py		train_sft_grpo_batch.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FMBench: Adaptive Large Language Model Output Formatting

⚙️ Installation

🤗 Setup

📌 Getting Started

Supervised Fine-tuning

SFT+GRPO Fine-tuning

GRPO Fine-tuning

Run Inference & Evaluation

About

Uh oh!

Releases

Packages

Languages

License

FudanCVL/FMBench

Folders and files

Latest commit

History

Repository files navigation

FMBench: Adaptive Large Language Model Output Formatting

⚙️ Installation

🤗 Setup

📌 Getting Started

Supervised Fine-tuning

SFT+GRPO Fine-tuning

GRPO Fine-tuning

Run Inference & Evaluation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages