Skip to content
Logo LogoEvalScope
Docs Blogs
⌘ K
Logo LogoEvalScope
Docs Blogs

🚀 Quick Start

  • Introduction
  • Installation
  • Basic Usage
  • Supported Datasets

🔧 User Guides

  • Offline Evaluation
  • Arena Mode
  • Model Inference Stress Testing
    • Quick Start
    • Parameter Description
    • Examples
    • Speed Benchmark Testing
    • Custom Usage
  • Evaluation Backends
    • OpenCompass
    • VLMEvalKit
    • RAGEval
      • MTEB
      • CLIP Benchmark
      • RAGAS

🛠️ Advanced Tutorials

  • Custom Dataset Evaluation
  • Custom Model Evaluation

🧰 Third-Party Tools

  • ToolBench
  • LongBench-Write

🧪 Benchmarking Results

  • Benchmarking
    • MMLU
  • Speed Benchmarking
    • QwQ-32B-Preview

📖 Best Practices

  • ms-swift Integration
  • Full-Chain LLM Training
EvalScope
/
Model Inference Stress Testing

Model Inference Stress Testing#

A stress testing tool for large language models that can be customized to support various dataset formats and different API protocol formats, with default support for the OpenAI API format.

  • Quick Start
    • Environment Preparation
    • Basic Usage
  • Parameter Description
    • Basic Settings
    • Network Configuration
    • Request Control
    • Prompt Settings
    • Dataset Configuration
    • Model Settings
    • Data Storage
  • Examples
    • Using Local Model Inference
    • Using prompt
    • Complex Requests
    • Using query-template
    • Using wandb to Record Test Results
    • Debugging Requests
  • Speed Benchmark Testing
    • Online API Inference
    • Local Transformer Inference
    • Local vLLM Inference
  • Custom Usage
    • Custom Result Analysis
    • Custom Request API
    • Custom Dataset
Arena Mode
Quick Start

© 2022-2024, Alibaba ModelScope Built with Sphinx 8.1.3