Is AI Benchmarking Broken? The Truth Behind "con@64" Revealed Brought to you by Avonetics.com
Feb 20 2025
Length: 9 mins
Podcast

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to Cart failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Please try again

Unfollow podcast failed

Please try again

Is AI Benchmarking Broken? The Truth Behind "con@64" Revealed Brought to you by Avonetics.com

Listen for free

View show details

Summary
Discover the controversial "con@64" technique, where AI models are prompted 64 times to reach a consensus answer. Is this a legitimate way to reduce variance or a sneaky trick to inflate benchmark scores? Dive into the heated debate on whether this practice skews real-world performance comparisons and unfairly impacts perceptions of model capabilities. Learn why some accuse XAI engineers of overhyping AI and how differing "con" values could be misleading the industry. For advertising opportunities, visit Avonetics.com.

Show more Show less

Show more Show less

What listeners say about Is AI Benchmarking Broken? The Truth Behind "con@64" Revealed Brought to you by Avonetics.com

Average customer ratings

Reviews - Please select the tabs below to change the source of reviews.

Audible.com reviews

Amazon reviews

No Reviews are Available

Report a review on Amazon