斯坦福大学推出的大模型评测体系
暂无相关新闻。
用于构建LLMs自定义评估和评分系统的AI平台。
Comprehensive platform for AI evaluation and observability.
Interactive platform for comparing AI model capabilities.
Streamline prompt evaluation for AI models.
Model evaluation and sharing made simple.