Claude Opus 4.6: This AI just passed the 'vending machine test' - and we may want to be worried about how it did
Teona Gherasim
When leading AI company Anthropic launched its latest AI model, Claude Opus 4.6, at the end of last week, it broke many measures of intelligence and effectiveness - including one crucial benchmark: the vending machine test. Yes, AIs run vending machines now, under the watchful eyes of researchers at Anthropic and AI thinktank Andon Labs. The idea is to test the AI's ability to coordinate multiple different logistical and strategic challenges over a long period. As AI shifts from talking to
din zilele anterioare