FlagEval

智源研究院推出的FlagEval（天秤）大模型评测平台

1k 热度 AI模型评测

软件新闻

暂无相关新闻。

用于构建LLMs自定义评估和评分系统的AI平台。

Comprehensive platform for AI evaluation and observability.

Interactive platform for comparing AI model capabilities.

Streamline prompt evaluation for AI models.

Model evaluation and sharing made simple.