저장소

SantanderAI/autoguardrails

Alignment-research scaffold (autoresearch-style) for LLM guardrails: search over a single policy.md surface

Stars
115
Forks
31
Issues
2
Updated
7월 1일
Language
Python
License
Apache-2.0
#ai#ai-safety#alignment#autoresearch#benchmark#content-moderation#evaluation#guardrails
GitHub 열기 ↗