由复旦大学NLP实验室推出的大模型评测基准
暂无相关新闻。
用于构建LLMs自定义评估和评分系统的AI平台。
Comprehensive platform for AI evaluation and observability.
Interactive platform for comparing AI model capabilities.
Streamline prompt evaluation for AI models.
Model evaluation and sharing made simple.