Close Menu
  • Home
  • AI
  • Art & Style
  • Economy
  • Entertainment
  • International
  • Market
  • Opinion
  • Politics
  • Sports
  • Trump
  • US
  • World
What's Hot

180 skyscrapers in Gaza: Trump’s son-in-law Kushner releases ‘master plan’ to rebuild enclave

January 22, 2026

Fenerbahce 0 – 1 Aston Villa

January 22, 2026

Google DeepMind CEO is ‘surprised’ as OpenAI rushes to advertise on ChatGPT

January 22, 2026
Facebook X (Twitter) Instagram
WhistleBuzz – Smart News on AI, Business, Politics & Global Trends
Facebook X (Twitter) Instagram
  • Home
  • AI
  • Art & Style
  • Economy
  • Entertainment
  • International
  • Market
  • Opinion
  • Politics
  • Sports
  • Trump
  • US
  • World
WhistleBuzz – Smart News on AI, Business, Politics & Global Trends
Home » Anthropic must continue to revise their technical interview tests so Claude can’t cheat
AI

Anthropic must continue to revise their technical interview tests so Claude can’t cheat

Editor-In-ChiefBy Editor-In-ChiefJanuary 22, 2026No Comments2 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email


Since 2024, Anthropic’s Performance Optimization team has been giving job candidates take-home tests to check their knowledge. But as AI coding tools have improved, testing has had to change significantly to stay ahead of AI-powered fraud.

Team leader Tristan Hume explained the history of the challenge in a blog post Wednesday. “Each time a new Claude model appeared, the tests had to be redesigned,” Hume writes. “Given the same time limit, Claude Opus 4 outperformed most human applicants. It was still able to distinguish the strongest candidates, but then Claude Opus 4.5 even matched those applicants.”

This results in serious problems in evaluating candidates. Without in-person proctoring, there is no way to tell if someone is using AI to cheat on an exam. If a person cheats, he or she will quickly rise to the top. “Under the constraints of the take-home test,” Hume writes, “there was no longer any way to distinguish between the accomplishments of the best candidates and the most competent models.”

The issue of AI cheating is already causing havoc in schools and universities around the world, so it’s ironic that AI labs are also having to deal with it. But Anthropic is also uniquely equipped to address this issue.

Ultimately, Hume designed a new test that had less to do with hardware optimization and was novel enough to overwhelm modern AI tools. However, as part of the post, he shared his original test to see if anyone reading could come up with a better solution.

“If you can achieve Opus 4.5, we’d love to hear from you,” the post reads.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Editor-In-Chief
  • Website

Related Posts

Google DeepMind CEO is ‘surprised’ as OpenAI rushes to advertise on ChatGPT

January 22, 2026

Spotify brings AI-powered prompt playlists to US and Canada

January 22, 2026

From invisibility cloaks to AI chips: Neurophos raises $110 million to build tiny optical processors for inference

January 22, 2026
Add A Comment

Comments are closed.

News

Aziz Al Shair’s anti-genocide memo: Muslims give NFL credit for philanthropy | American Football News

By Editor-In-ChiefJanuary 22, 2026

Palestinian rights groups are collecting charity donations in the name of American football player Azeez…

Former Special Counsel Jack Smith testifies before US Congressional Committee | Donald Trump News

January 22, 2026

Trump launches peace commission at signing ceremony in Davos | Donald Trump News

January 22, 2026
Top Trending

Google DeepMind CEO is ‘surprised’ as OpenAI rushes to advertise on ChatGPT

By Editor-In-ChiefJanuary 22, 2026

Google DeepMind CEO Demis Hassabis said he was “surprised” that OpenAI was…

Spotify brings AI-powered prompt playlists to US and Canada

By Editor-In-ChiefJanuary 22, 2026

Spotify is rolling out Prompted Playlists, a new AI playlist creation tool,…

Anthropic must continue to revise their technical interview tests so Claude can’t cheat

By Editor-In-ChiefJanuary 22, 2026

Since 2024, Anthropic’s Performance Optimization team has been giving job candidates take-home…

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Welcome to WhistleBuzz.com (“we,” “our,” or “us”). Your privacy is important to us. This Privacy Policy explains how we collect, use, disclose, and safeguard your information when you visit our website https://whistlebuzz.com/ (the “Site”). Please read this policy carefully to understand our views and practices regarding your personal data and how we will treat it.

Facebook X (Twitter) Instagram Pinterest YouTube

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • Advertise With Us
  • Contact US
  • DMCA Policy
  • Privacy Policy
  • Terms & Conditions
  • About US
© 2026 whistlebuzz. Designed by whistlebuzz.

Type above and press Enter to search. Press Esc to cancel.