Introduction

Code for paper: Multi-hop Reasoning via Early Knowledge Alignment. We show here the implementation based on Graph-R1.

Repository Layout

graphr1/: retrieval + hypergraph core.
agent/: agent/tool wrappers.
verl/: RL training stack (GRPO / PPO / REINFORCE++).
script_build.py: build Knowledge HyperGraph and FAISS indexes.
script_api.py: retrieval API server (port 9001).
get_knowledge.py: retrieve top-k early knowledge per question.
script_process.py: build parquet data with early-knowledge prompts.
run_grpo.sh, run_ppo.sh, run_rpp.sh: training entrypoints.
evaluation/: evaluation helpers.

Environment Setup

conda create -n eka python==3.11.11
conda activate eka
pip install torch==2.4.0 --index-url https://download.pytorch.org/whl/cu124
pip install flash-attn --no-build-isolation
pip install -r requirements.txt
pip install -e .

If you run the retrieval server, make sure faiss, FlagEmbedding, fastapi, uvicorn, and datasets are included in requirements.txt for your environment.

Data Preparation

We conduct experiments on six datasets: 2WikiMultiHopQA, HotpotQA, Musique, NQ, PopQA, and TriviaQA. Graph-R1 has the download link from TeraBox and you can place them under datasets/. We prepare another link [https://pan.baidu.com/s/1dlVYbApjagclIlN7FY4mNw?pwd=6yar] for researchers not available to TeraBox.

Expected raw layout:

datasets/<DATASET>/raw/qa_train.json
datasets/<DATASET>/raw/qa_dev.json
datasets/<DATASET>/raw/qa_test.json
datasets/<DATASET>/corpus.jsonl

1. Build Knowledge HyperGraph

script_build.py reads datasets/<DATASET>/corpus.jsonl directly and writes to expr/<DATASET>. No parquet is needed for this step. Provide openai_api_key.txt in the repo root.

python script_build.py --data_source 2WikiMultiHopQA

Pre-built Knowledge HyperGraph download: Graph-R1 has the download link from TeraBox, You can place the dataset folders under expr/. We prepare another link [https://pan.baidu.com/s/1T-7NvLnI8tUlkZ63I0B5tA?pwd=y38j] for researchers not available to TeraBox.

2. Start Retrieval Server

python script_api.py --data_source 2WikiMultiHopQA

3. Retrieve Early Knowledge

python get_knowledge.py --data_source 2WikiMultiHopQA --top_n 5 --api_url http://localhost:9001

4. Convert to Parquet

python script_process.py --data_source 2WikiMultiHopQA

Reads: ik_datasets/<DATASET>/datasets_raw_with_top_5_knowledge

Writes: ik_datasets/<DATASET>/datasets_processed_with_top_5_knowledge

Training

bash run_grpo.sh -p Qwen/Qwen2.5-3B-Instruct -m Qwen2.5-3B-Instruct -d 2WikiMultiHopQA -u <unique_id>

Training scripts expect: datasets/<DATASET>/processed_with_top_5_knowledge. If needed, move the parquet files or create a symlink.

Evaluation

See evaluation/README.md.

Acknowledgements

Some retrieval and training components build on prior open-source Graph-R1. Thanks to the community for releasing these building blocks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Repository Layout

Environment Setup

Data Preparation

1. Build Knowledge HyperGraph

2. Start Retrieval Server

3. Retrieve Early Knowledge

4. Convert to Parquet

Training

Evaluation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
agent		agent
evaluation		evaluation
graphr1		graphr1
verl		verl
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval.py		eval.py
get_chunk.py		get_chunk.py
get_knowledge.py		get_knowledge.py
process_all_dataset.sh		process_all_dataset.sh
requirements.txt		requirements.txt
run_grpo.sh		run_grpo.sh
run_ppo.sh		run_ppo.sh
run_rpp.sh		run_rpp.sh
script_api.py		script_api.py
script_build.py		script_build.py
script_process.py		script_process.py
setup.py		setup.py

Folders and files

Latest commit

History

Repository files navigation

Introduction

Repository Layout

Environment Setup

Data Preparation

1. Build Knowledge HyperGraph

2. Start Retrieval Server

3. Retrieve Early Knowledge

4. Convert to Parquet

Training

Evaluation

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages