Close Menu
  • Home
  • AI
  • Art & Style
  • Economy
  • Entertainment
  • International
  • Market
  • Opinion
  • Politics
  • Sports
  • Trump
  • US
  • World
What's Hot

Runway started by supporting filmmakers. Now they are trying to beat Google with AI.

May 15, 2026

Inflation expected to reach 6% in second quarter, top economic forecasters say

May 15, 2026

Prediction markets – retail investors have a new “toy” for speculation

May 15, 2026
Facebook X (Twitter) Instagram
Smart Breaking News on AI, Business, Politics & Global Trends | WhistleBuzz
Facebook X (Twitter) Instagram
  • Home
  • AI
  • Art & Style
  • Economy
  • Entertainment
  • International
  • Market
  • Opinion
  • Politics
  • Sports
  • Trump
  • US
  • World
Smart Breaking News on AI, Business, Politics & Global Trends | WhistleBuzz
Home » Anthropic must continue to revise their technical interview tests so Claude can’t cheat
AI

Anthropic must continue to revise their technical interview tests so Claude can’t cheat

Editor-In-ChiefBy Editor-In-ChiefJanuary 22, 2026No Comments2 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email


Since 2024, Anthropic’s Performance Optimization team has been giving job candidates take-home tests to check their knowledge. But as AI coding tools have improved, testing has had to change significantly to stay ahead of AI-powered fraud.

Team leader Tristan Hume explained the history of the challenge in a blog post Wednesday. “Each time a new Claude model appeared, the tests had to be redesigned,” Hume writes. “Given the same time limit, Claude Opus 4 outperformed most human applicants. It was still able to distinguish the strongest candidates, but then Claude Opus 4.5 even matched those applicants.”

This results in serious problems in evaluating candidates. Without in-person proctoring, there is no way to tell if someone is using AI to cheat on an exam. If a person cheats, he or she will quickly rise to the top. “Under the constraints of the take-home test,” Hume writes, “there was no longer any way to distinguish between the accomplishments of the best candidates and the most competent models.”

The issue of AI cheating is already causing havoc in schools and universities around the world, so it’s ironic that AI labs are also having to deal with it. But Anthropic is also uniquely equipped to address this issue.

Ultimately, Hume designed a new test that had less to do with hardware optimization and was novel enough to overwhelm modern AI tools. However, as part of the post, he shared his original test to see if anyone reading could come up with a better solution.

“If you can achieve Opus 4.5, we’d love to hear from you,” the post reads.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Editor-In-Chief
  • Website

Related Posts

Runway started by supporting filmmakers. Now they are trying to beat Google with AI.

May 15, 2026

Osaurus brings both local and cloud AI models to Mac

May 15, 2026

What the jury will actually decide in the Elon Musk vs. Sam Altman case

May 14, 2026
Add A Comment

Comments are closed.

News

Trump-Xi summit: China and the US disagree on the content of the agreement | Business and economic news

By Editor-In-ChiefMay 15, 2026

US President Donald Trump departed China on Friday after a two-day summit with Chinese President…

After the Beijing summit, President Trump and President Xi shift to a relationship that prioritizes business | Xi Jinping News

May 15, 2026

How the summit between Mr. Xi and President Trump did not result in a breakthrough in the Iran war | Donald Trump News

May 15, 2026
Top Trending

Runway started by supporting filmmakers. Now they are trying to beat Google with AI.

By Editor-In-ChiefMay 15, 2026

AI video generation startup Runway doesn’t have a typical Silicon Valley pedigree.…

Osaurus brings both local and cloud AI models to Mac

By Editor-In-ChiefMay 15, 2026

As AI models become increasingly commoditized, startups are racing to build a…

What the jury will actually decide in the Elon Musk vs. Sam Altman case

By Editor-In-ChiefMay 14, 2026

A nine-member jury in California is currently deliberating the future of OpenAI,…

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Welcome to WhistleBuzz.com (“we,” “our,” or “us”). Your privacy is important to us. This Privacy Policy explains how we collect, use, disclose, and safeguard your information when you visit our website https://whistlebuzz.com/ (the “Site”). Please read this policy carefully to understand our views and practices regarding your personal data and how we will treat it.

Facebook X (Twitter) Instagram Pinterest YouTube

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • Advertise With Us
  • Contact US
  • DMCA Policy
  • Privacy Policy
  • Terms & Conditions
  • About US
© 2026 whistlebuzz. Designed by whistlebuzz.

Type above and press Enter to search. Press Esc to cancel.