World news | 20Fix.com

A new method to steer AI output uncovers vulnerabilities and potential improvements

A team of researchers has found a way to steer the output of large language models by manipulating specific concepts inside these models. The new method could lead to more reliable, more efficient, and less computationally expensive training of LLMs. But it also exposes potential vulnerabilities.

play_arrow All United States of America News

2 w.

Technology

ID: 8435034979706414859

Similar News expand_more

7 d. New ensemble AI model enhances cyber intrusion detection with high accuracy

Technology

2 w. Indian AI lab Sarvam’s new models are a major bet on the viability of open source AI

Technology

4 w. Analysis: Deepfake Fraud Explodes as AI Tools Become Widely Accessible

Technology

2 w. Pinpointing direction in noisy 2D data: New algorithm could improve imaging, AI, particle research and more

Technology

9 d. AI is getting smarter, but not wiser: A new roadmap aims to fix that gap

Technology

2 w. Indian AI lab Sarvam’s new models are a major bet on the viability of open

Technology

1 w. Perplexity’s new Computer is another bet that users need many AI models

Technology

2 w. AI model edits can leak sensitive data via update 'fingerprints'

Technology

1 M. New framework pinpoints conditions that make data augmentation improve robustness

Technology

1 M. 3 Questions: Using AI to accelerate the discovery and design of therapeutic drugs

Science

1 M. HHS Is Making an AI Tool to Create Hypotheses About Vaccine Injury Claims

Science

2 w. Research project launches free tool to make AI safer and more trustworthy

Technology

5 d. How our AI bots are ignoring their programming and giving hackers superpowers

Technology

5 d. DiligenceSquared uses AI, voice agents to make M&A research affordable

Technology

1 M. New hazards to be analyzed in Alaska’s updated statewide threat assessment

Weather

1 M. Helping AI agents search to get the best results out of large language models

Technology

2 w. Researchers develop a system that detects subtle defects missed by existing industrial visual inspection

Science

2 w. Security vulnerabilities in Tesla's Model 3 and Cybertruck reveal how connected cars can be hacked

Automotive

2 w. AI energy use: New tools show which model consumes the most power, and why

Technology

2 w. This software engineer pivoted to an AI role. Here's what helped him make the change.

Technology

1 M. Is artificial general intelligence already here? A new case that today's LLMs meet key tests

Technology

5 d. New insights into a hidden process that protects cells from harmful mutations

Science

2 w. Jailbreaking the matrix: How researchers are bypassing AI guardrails to make them safer

Automotive

6 d. A “ChatGPT for spreadsheets” helps solve difficult engineering challenges faster

Technology

6 d. 'ChatGPT for spreadsheets' helps solve difficult engineering challenges faster

Technology

14 h. AI network startup Eridu emerges from stealth with hefty $200M Series A

Technology

5 d. FBI investigating hack on its wiretap and surveillance systems: Report

Crime

2 w. AI agents have their own social network: Moltbook study tracks topics and toxicity

Science

2 w. Google’s Cloud AI leads on the three frontiers of model capability

Technology

1 M. Understanding the hazard potential of the Seattle fault zone: It's 'pretty close to home'

Weather

7 h. Lurking dementia risk exposed by breakthrough test 25 years before symptoms

Science

2 w. Google’s Cloud AI lead on the three frontiers of model capability

Technology

1 M. AI is failing 'Humanity's Last Exam'—so what does that mean for machine intelligence?

Science

4 w. Vega raises $120M Series B to rethink how enterprises detect cyber threats

Technology

4 d. Anthropic’s Claude found 22 vulnerabilities in Firefox over two weeks

Science

2 w. From automated farm tractors to exam paper grading, AI boosts efficiency for some in India

Technology

3 w. Investigators Hope DNA Testing Can Provide Breakthrough in Guthrie Case

Science

27 h. AI and work: An expert assesses how far this revolution still has to run

Technology

3 w. Replay: Vision 2026: Predicting The Next Major Changes In Crypto (Video)

Entertainment

12 h. Researchers put six AI agents on Discord for two weeks, exposing risky failures

Science

7 d. A suite of government hacking tools targeting iPhones is now being used by cybercriminals

Technology

4 w. AIDEA releases 'independent' and 'unbiased' analysis, revised by contractor to please AIDEA

Technology

1 w. Exclusive: U.S. must overhaul military readiness and tech metrics, report urges

Military

8 d. Innovaccer Receives Frost & Sullivan's 2026 United States New Product Innovation Recognition for Excellence in AI

Technology

2 w. Nancy Guthrie disappearance: Former FBI agent reveals amount of time likely needed for advanced DNA testing

Science

5 d. Study proposes ways to control unforeseen leaks in underground excavations

Technology

5 h. Can AI read papers like a scientist? A new benchmark shows where LLMs fail

Science

3 d. What does the US military’s feud with Anthropic mean for AI used in war? | AI (artificial intelligence)

Military

2 w. Opinion | The FDA’s damage to medical innovation will be hard to repair

Science