🤖 Model Performance Comparison Tool

Compare LLM performance on multiple-choice questions using Hugging Face models.

Format: Each line should have: Question,Correct Answer,Choice1,Choice2,Choice3

💡 Features:

📊 Results

Results will appear here...

Accuracy Comparison

Confidence Analysis

Detailed results will appear here...

This tool loads and runs HuggingFace models for evaluation:

🏗️ How it works:

⚡ Performance Tips:

🔧 Supported Models: