hijkzzz

Follow

hijkzzz

Follow

AI agent enthusiast, from RL to LLM, RLHF, MLSys.

186 followers · 50 following

https://hujian.website/

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Block or Report

Block or report hijkzzz

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

hijkzzz/README.md

🔭 I'm a Coding Lover.

Pinned

OpenLLMAI/OpenRLHF OpenLLMAI/OpenRLHF Public

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1.6k 136
pymarl2 pymarl2 Public

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Python 573 112
alpha-zero-gomoku alpha-zero-gomoku Public

A Multi-threaded Implementation of AlphaZero

Python 356 48
cuda-neural-network cuda-neural-network Public

Convolutional Neural Network with CUDA (MNIST 99.23%)

C++ 165 38
deep-reinforcement-learning-notes deep-reinforcement-learning-notes Public

Deep Reinforcement Learning Notes

116 6
noisy-mappo noisy-mappo Public

Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)

Python 43 6