shivdev79/agent-boundary-v1
agent-boundary-v1
SUMMARY AI summary by gpt-5-mini
AgentBoundary-v1 は、企業向け自動化エージェントに「いつ行動すべきでないか」を学ばせるための OpenEnv 形式の強化学習環境です。対象は企業のAI安全担当者、RL/LLM研究者、プロダクションエージェント開発者で、判断(ACT / ASK / ESCALATE / REFUSE)を定量化・学習可能にします。 主な特徴は次の通りです。5つのタスク(易〜難、長期・敵対的・バッチ群)を備え、各ステップで意思決定に加え「可視証拠に基づく正当化」「適切なエスカレーション先」「質問の焦点」「選ぶツール」「構造化監査ノート」を出力させる点。報酬は8つの独立した決定論的コンポーネント(安全性、較正、情報収集、エスカレーション品質等)で構成され、ゲーミングを防ぐ設計です。OpenEnv 互換で、LLMベースの GRPO と単純な REINFORCE 線形ポリシーを想定した学習スタックが提供されます。目的は「判断」を可視化・測定・学習させ、実運用での過剰行動や過剰エスカレーションを減らすことです。
Language breakdown (by bytes)
Owner
Dates
| Created on GitHub | 2026-04-25 |
| Last push | 2026-05-09 |
| First seen here | 2026-05-09 |
| Last fetched | 2026-05-09 18:16 |
Similar repos (same language)
Deep learning framework for automated pneumonia detection from chest X-ray images using transfer learning, data augmentation, and ensemble-based medical image classification. Built with PyTorch using ResNet50, evaluation metrics, ROC analysis, confusion matrices, and visualization for reliable AI-assisted diagnosis.
Luffy-2520/infotact-project1-grievance-nlpAI-Powered Citizen Grievance & Sentiment Analysis System | NLP Project | Infotact Internship
ssprajapati2021/MLOpsAssignmentMLOps assignment from the Applied AI & Agentic AI program at IIITB with upGrad — covering model lifecycle, CI/CD pipelines, and deployment practices.
encoder-010/infotact-project1-grievance-nlpAI-Powered Citizen Grievance & Sentiment Analysis System | NLP Project | Infotact Internship
gururaj004/air-quality-index-analysisEDA and visualization project analyzing Air Quality Index (AQI) trends and pollution patterns using Python and Jupyter Notebook.
Yamuna-6730/adaptive-federated-idsAdaptive Privacy-Preserving Federated Intrusion Detection Framework for AI-Era Cybersecurity using LSTM, Differential Privacy, and Real-Time Threat Intelligence.
sara-7/artificial-intelligence-coursePractical AI course materials including labs, assignments, and hands-on implementations using Python.
Tejass1303/air-temperature-time-series-analysis