LLM4Band

This repository contains the source code for the paper "LLM4Band: Enhancing Reinforcement Learning with Large Language Models for Accurate Bandwidth Estimation".

How to use?

Offline training

Download the dataset from RL4BandwidthEstimationChallenge, download the pre-trained model from huggingface(gpt2, t5, qwen).
Split the dataset and preprocess the data (pickle format).
Replace the model path in the code, train model: run IQL.py.

Offline testing

Prepare offline testing scenario in validation/prepare_scenario, evaluate the model in validation/evaluate.

Online application

Test environment: AlphaRTC
Download link for the docker image: alphartc4band
Download link for the test media: testmedia

Limit port traffic, run:

  modprobe sch_netem

  modprobe sch_htb

  docker run --rm -it -v $(pwd)/LLM4Band:/app -w /app -e PYTHONPATH=/usr/lib/python3/dist-packages --name alphartc4band --cap-add=NET_ADMIN alphartc4band

Entering the container, run:

    sudo /root/go/bin/comcast --device lo --target-port 8000 --target-bw 200 --latency 50 --packet-loss 1

    peerconnection_serverless receiver_pyinfer.json

Stop:
```
    comcast --device lo --stop
```

Perform the test task in another terminal：

  docker exec alphartc4band peerconnection_serverless sender_pyinfer.json

Calculate the score：

    docker run --rm -v `pwd`/LLM4Band:/app -w /app/metrics --name eval alphartc4band python3 eval_network.py --dst_network_log /app/logging/webrtc.log --output /app/result/out_eval_network.json --ground_recv_rate 500 --max_delay 500

Citation

@inproceedings{wang2025llm4band,
  title={LLM4Band: Enhancing Reinforcement Learning with Large Language Models for Accurate Bandwidth Estimation},
  author={Wang, Zhijian and Lu, Rongwei and Zhang, Zhiyang and Westphal, Cedric and He, Dongbiao and Jiang, Jingyan},
  booktitle={Proceedings of the 35th Workshop on Network and Operating System Support for Digital Audio and Video},
  pages={43--49},
  year={2025}
}

Acknowledgments

RL4BandwidthEstimationChallenge - dataset
AlphaRTC - simulation platform
NAORL, CORL, HuggingFace - tools
BoB, Schaferct, HRCC, Pioneer - baselines

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
Application/LLM4Band		Application/LLM4Band
GPT2_for_bwe		GPT2_for_bwe
Pioneer/onnx_model		Pioneer/onnx_model
Qwen_for_bwe		Qwen_for_bwe
Schaferct		Schaferct
T5_for_bwe		T5_for_bwe
dataset		dataset
dataset_analysis		dataset_analysis
download-dataset		download-dataset
mlp		mlp
validation		validation
README.md		README.md
environment.yml		environment.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM4Band

How to use?

Offline training

Offline testing

Online application

Citation

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

WzjCoder/LLM4Band

Folders and files

Latest commit

History

Repository files navigation

LLM4Band

How to use?

Offline training

Offline testing

Online application

Citation

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages