“LMArena is a cancer”: How LLM rankings distort the AI sector
Tehnologie
Anyone who wants to get a quick overview of how good (or bad) new AI models from OpenAI, xAI, Google, Anthropic, DeepSeek and many other companies are has several options. Either you believe the PR statements from the companies, which like to highlight selected test results and market themselves as world-class. Or you consult different web services like LMArena (recently valued at 1.7 billion dollars, more on that here ), Artificial Analysis or OpenRouter , each of which has its own evaluation
din zilele anterioare