← ランキング · AI関連リポジトリ

bhargav141223/warehouse-multi-agent-rl-using-mappo

A comprehensive full-stack application for multi-agent warehouse navigation using Multi-Agent Proximal Policy Optimization (MAPPO) with Large Language Model (LLM) reward shaping and Retrieval-Augmented Generation (RAG) memory.

GitHub ↗

★ 0

stars

AI関連スコア

個人開発度

AIツール痕跡

SUMMARY AI要約 by gpt-5-mini

A codebase for multi-agent warehouse navigation that trains cooperative policies with Multi-Agent Proximal Policy Optimization (MAPPO) enhanced by LLM-based reward shaping and a Retrieval-Augmented Generation (RAG) memory layer. Intended for researchers and engineers working on multi-agent RL, warehouse automation, and human-in-the-loop reward design. Key features: - Simulated multi-agent warehouse environment and task scenarios - MAPPO training and evaluation pipelines for synchronized cooperative policies - LLM-in-the-loop reward shaping to incorporate high-level guidance into scalar rewards - RAG memory to store and retrieve episodic/contextual information for agents - Utilities for checkpointing, evaluation, and visualizing agent behavior

オーナー情報

アカウント

bhargav141223

タイプ

User

フォロワー

日付

GitHub作成日	2026-05-09
最終Push	2026-05-09
当サイト初検出	2026-05-09
最終取得	2026-05-09 18:14