EvasionBench: Detecting Evasive Answers in Financial Q&A via Multi-Model Consensus and LLM-as-Judge Paper • 2601.09142 • Published 13 days ago • 9
Eva-4B Collection Eva-4B: Financial Evasion Detection Model. This Hugging Face Collection groups the base Eva-4B model together with its GGUF releases. • 6 items • Updated 15 days ago • 2
DramaBench: A Six-Dimensional Evaluation Framework for Drama Script Continuation Paper • 2512.19012 • Published Dec 22, 2025 • 17