Custom Model#
LLM Model Evaluation#
Evaluation of large language models currently supports two methods: Native Evaluation and OpenCompass Evaluation.
Method 1: Using Native Evaluation#
See also
Method 2: Using OpenCompass Evaluation#
See also
VLM Model Evaluation#
Currently, evaluation of multimodal models only supports the VLMEvalKit backend.
Using VLMEvalKit Backend#
See also