Historical test results from earlier test suites (archived)
Archived Test Results
These results are from earlier test suites that have been superseded by our Enhanced Professional Suite v2.0. They are kept for historical reference but may not reflect current model capabilities. For the most accurate and comprehensive results, see the Professional Test Results.
Complete historical test data including all test suites
Meta
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Meta
Meta
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Meta
Cohere
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Cohere
Cohere
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Cohere
Cohere
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Cohere
Cohere
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Cohere
Mistral AI
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Mistral AI
Mistral AI
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Mistral AI
Mistral AI
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Mistral AI
Mistral AI
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Mistral AI
Mistral AI
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Mistral AI
Mistral AI
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Mistral AI
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Anthropic
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Anthropic
Anthropic
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Anthropic
Anthropic
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Anthropic
Anthropic
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Anthropic
Anthropic
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Anthropic
Anthropic
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
Anthropic
OpenAI
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
OpenAI
OpenAI
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
OpenAI
OpenAI
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
OpenAI
OpenAI
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
OpenAI
Anthropic
Overall Score
Tests Run
94
Total Available
Last Tested
Provider
Anthropic
Category Scores
Pii Leakage
Logic Traps
Structural Stress
Linguistic Obfuscation
Cognitive Noise
Social Engineering
Anthropic
Overall Score
Tests Run
94
Total Available
Last Tested
Provider
Anthropic
Category Scores
Linguistic Obfuscation
Cognitive Noise
Social Engineering
Pii Leakage
Logic Traps
Structural Stress
OpenAI
Overall Score
Tests Run
94
Total Available
Last Tested
Provider
OpenAI
Category Scores
Structural Stress
Linguistic Obfuscation
Cognitive Noise
Social Engineering
Pii Leakage
Logic Traps
OpenAI
Overall Score
Tests Run
111
Total Available
Last Tested
Provider
OpenAI
Category Scores
Safety
Jailbreak
Bias
Privacy
Linguistic Obfuscation
Cognitive Noise
Social Engineering
Pii Leakage
Structural Stress
Logic Traps
Overall Score
Tests Run
94
Total Available
Last Tested
Provider
Category Scores
Linguistic Obfuscation
Cognitive Noise
Social Engineering
Pii Leakage
Logic Traps
Structural Stress
Overall Score
Tests Run
100
Total Available
Last Tested
Provider
Category Scores
Safety
Jailbreak
Bias
Linguistic Obfuscation
Cognitive Noise
Social Engineering
Pii Leakage
Logic Traps
Structural Stress
Anthropic
Overall Score
Tests Run
94
Total Available
Last Tested
Provider
Anthropic
Category Scores
Linguistic Obfuscation
Cognitive Noise
Social Engineering
Pii Leakage
Logic Traps
Structural Stress
Anthropic
Overall Score
Tests Run
90
Total Available
Last Tested
Provider
Anthropic
Category Scores
Bias
Safety
Jailbreak
Cognitive Noise
Social Engineering
Pii Leakage
Logic Traps
Structural Stress
Linguistic Obfuscation
OpenAI
Overall Score
Tests Run
106
Total Available
Last Tested
Provider
OpenAI
Category Scores
Safety
Jailbreak
Bias
Linguistic Obfuscation
Cognitive Noise
Social Engineering
Pii Leakage
Logic Traps
Structural Stress
OpenAI
Overall Score
Tests Run
106
Total Available
Last Tested
Provider
OpenAI
Category Scores
Safety
Jailbreak
Bias
Linguistic Obfuscation
Cognitive Noise
Social Engineering
Pii Leakage
Structural Stress
Logic Traps
Overall Score
Tests Run
94
Total Available
Last Tested
Provider
Category Scores
Logic Traps
Cognitive Noise
Social Engineering
Pii Leakage
Linguistic Obfuscation
Structural Stress
Anthropic
Overall Score
Tests Run
106
Total Available
Last Tested
Provider
Anthropic
Category Scores
Pii Leakage
Cognitive Noise
Social Engineering
Structural Stress
Logic Traps
Linguistic Obfuscation
OpenAI
Overall Score
Tests Run
0
Total Available
Last Tested
Provider
OpenAI
OpenAI
Overall Score
Tests Run
17
Total Available
Last Tested
Provider
OpenAI
Category Scores
Safety
Bias Detection