Sums It Up
Generative AI is absolutely terrible at summarizing information compared to humans, according to the findings of a trial for the Australian Securities and Investment Commission (ASIC) spotted by Australian outlet Crikey.
The trial, conducted by Amazon Web Services, was commissioned by the government regulator as a proof of concept for generative AI's capabilities, and in particular its potential to be used in business settings.
That potential, the trial found, is not looking promising.
In a series of blind assessments, the generative AI summaries of real government documents scored a dire 47 percent on aggregate based on the trial's rubric, and were decisively outdone by the human-made summaries, which scored 81 percent.
The findings echo a common theme in reckonings with the current spate of generative AI technology: not only are AI models a poor replacement for human workers, but their awful reliability means it's unclear if they'll have any practical use in the workplace for the majority of organizations.
Signature Shoddiness
The assessment used Meta's open source Llama2-70B, which isn't the newest model out there, but with up to 70 billion parameters, it's certainly a capable one.
The AI model was instructed to summarize documents submitted to a parliamentary inquiry, and specifically to focus on what was related to ASIC, such as where the organization was mentioned, and to include references and page numbers. Alongside the AI, human employees at ASIC were asked to write summaries of their own.
Then five evaluators were asked to assess the human and the AI-generated summaries after reading the original documents. These were done blindly — the summaries were simply labeled A and B — and scorers had no clue that AI was involved at all.
Or at least, they weren't supposed to. At the end, when the assessors had finished up and were told about the true nature of the experiment, three said that they suspected they were looking at AI outputs, which is pretty damning on its own.
Sucks On All Counts
All in all, the AI performed lower on all criteria compared to the human summaries, the report said.
Strike one: the AI model was flat-out incapable of providing the page numbers of where it got its information.
That's something the report notes can be fixed with some tinkering with the AI model. But a more fundamental issue was that it regularly failed to pick up on nuance or context, and often made baffling choices about what to emphasize or highlight.
Beyond that, the AI summaries tended to include irrelevant and redundant information and were generally "waffly" and "wordy."
The upshot: these AI summaries were so bad that the assessors agreed that using them could require more work down the line, because of the amount of fact-checking they require. If that's the case, then the purported upsides of using the technology — cost-cutting and time-saving — are seriously called into question.
More on AI: NaNoWriMo Slammed for Saying That Opposition to AI-Generated Books Is Ableist
The post Government Test Finds That AI Wildly Underperforms Compared to Human Employees appeared first on Futurism.
Excerpt from:
Government Test Finds That AI Wildly Underperforms Compared to Human Employees
- Futurist Serata featuring artist Luca Buvoli at Brown (Nov. 20) [Last Updated On: November 7th, 2009] [Originally Added On: November 7th, 2009]
- FUTUR1SM00GGI [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- ‘Futurism on Film’ Series this month in NYC [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Schedule of Futurist Events in NYC (PERFORMA 09: Nov 1-22) [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- ‘Futurismo/Futurizm: The Futurist Avant-Garde in Italy and Russia’ (Nov. 13 + 14) [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- ‘Beyond Futurism: F.T. Marinetti, Writer’ conference at Columbia (Nov. 12+13) [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Futurism and Cars at the Museo Nicolis [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- MoMA Film Series Marks Centenary of Futurism with Films [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- ‘Bergson+Futurism. Speed in thought’ - Madrid (Nov. 5) [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- ‘The Future in Five Senses: Echoes of Italian Futurism in New York Architecture and Design’ Nov. 16th NYC [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- New World-Wide Climate Treaty in 2010 More Likely [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Tar Sands CCS Myth Shattered [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Smart Grid and Smart Meters Get Big Grants [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Pollution Makes Methane Even More Dangerous [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Climate Change Bill Hearing Video [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- New Satellite to Monitor Water and Plant Growth [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Spiritual Battle Awaits the Deniers and Skeptics [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Effects of Climate Change are Observed World-Wide [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Get Yer Global Warming Science Here [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- TckTckTck Wake up Call — Delay Kills [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Canada’s Awful Gold Rush [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Climate Change Talks Spark Global Backlash by Businesses [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- World May Need Extra Year for Climate Treaty [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Senator Boxer Moves Climate Bill Despite Republican Obstructionism [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Lights out for incandescent lights? [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Sutures from Bacteria [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Remote-Controlled Pigeons [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Apple Announces iPhone Release Date [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- UK Government Envisions a Grim Future [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Top Ten Emerging Technologies for the Environment [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- DIY Mobile Networks [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Stem-Cell Treatment Cures Type 1 Diabetes [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Is Tesla Getting the Electric Car Right? [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The Future of TV News [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Bruce Sterling on Earth-Friendly Pervasive Computing [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- First Step Toward Organ Regeneration in Humans [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- IBM's "Five in Five" [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Outsourced Journalism [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Is True Global Democracy the Next Great Political Movement? [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The Risks of Autonomous Robots [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Microsoft Introduces "Tabletop" PC [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Britain Piloting First Biofueled Train [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Self-Healing Plastic [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Bird Population Falls Over Past 40 Years [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The iPhone Revolution? [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The End of "Cheap Food"? [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- How to Stop -- Or Live With -- Global Warming [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- MIT Demonstrates "Wireless Electricity" [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Unintended Consequences of Biofuels [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Time to Focus on the Big Picture in Copenhagen [Last Updated On: December 12th, 2009] [Originally Added On: December 12th, 2009]
- Protests in Copenhagen [Last Updated On: December 12th, 2009] [Originally Added On: December 12th, 2009]
- Mario Guido Dal Monte exhibit [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Futurism News Bulletin, xvi [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Viva il Futurismo! (video trailer) [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- 3 exhibits in Gorizia! [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Forthcoming: ‘Antidiets of the Avant-garde’ by Cecilia Novero [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Pubblicità e propaganda. Ceramica e grafica futuriste at the Wolfsoniana [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Balla’s home scheduled to open in 2010 [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Futurismo a Savona [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- ‘Zang Sud Sud’, Cosenza [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Conference in Rome (Dec. 10) [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Climate Hackergate: A Well-Orchestrated Campaign of Harassment [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- The Sad Story of Cap and Trade [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- How to Waste Trillions on Capturing Carbon [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Smack the Email Hack Attack [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- EPA About to Declare CO2 a Public Danger [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Copenhagen Summit Starts with Virtually There Media [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Climate Scientist Gets Blunt on Trading Scheme [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- One Climate Change Editorial in 56 Newspapers, 45 Countries [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- This Decade Will be Hottest Ever on Record [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Divide and Conquer [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Leave the Coal in the Hole! [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- COP15: Two Agreements Coming [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Climate and Copenhagen News December 10 [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Sea Level Already Rising on Atlantic Coast [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- ‘Umbria Veloce’ in Perugia [Last Updated On: December 14th, 2009] [Originally Added On: December 14th, 2009]
- An Instable CO2-Filled Ocean [Last Updated On: December 14th, 2009] [Originally Added On: December 14th, 2009]
- ‘Futurismi a Ravenna’ opens Dec. 19 [Last Updated On: December 15th, 2009] [Originally Added On: December 15th, 2009]
- ‘Futurism and the Technological Imagination’ – 30% discount until Jan. 15 [Last Updated On: December 15th, 2009] [Originally Added On: December 15th, 2009]
- Protecting Our Lungs at Copenhagen [Last Updated On: December 15th, 2009] [Originally Added On: December 15th, 2009]