This repository contains the SFT+GRPO finetuning implementation for openPangu with the FMBench dataset.
Step 1: Install the environment
conda env create -f environment_pangu_grpo_root.ymlStep 2: Replace the installed transformers package with our custom source code.
Step1: Download the longformer weight checkpoint archive at Modelscopes and extract it to the project directory.
Step2: Download the pangu weight checkpoint archive at Modelscopes and extract it to the project directory.
Note: The scripts in this repository use the 1B model as an example. To run the 7B model, please update the command-line arguments accordingly.
bash run_sft.sh
bash run_sft_grpo_8npu.sh
bash run_grpo.sh
bash run_inf.sh
bash run_eval.sh