智源研究院推出的FlagEval(天秤)大模型评测平台
暂无相关新闻。
用于构建LLMs自定义评估和评分系统的AI平台。
Comprehensive platform for AI evaluation and observability.
Interactive platform for comparing AI model capabilities.
Streamline prompt evaluation for AI models.
Model evaluation and sharing made simple.