World news | 20Fix.com

home All News open_in_new Full Article

This Week in AI: Maybe we should ignore AI benchmarks for now

Welcome to TechCrunch’s regular AI newsletter! We’re going on hiatus for a bit, but you can find all our AI coverage, including my columns, our daily analysis, and breaking news stories, at TechCrunch. If you want those stories and much more in your inbox every day, sign up for our daily newsletters here. This week, billionaire […] © 2024 TechCrunch. All rights reserved. For personal use only.

The article discusses the recent release of xAI's Grok 3 AI model by Elon Musk, which outperforms other models in specific benchmarks. It critiques the reliance on AI benchmarks, arguing they often measure narrow, esoteric tasks rather than practical utility. The piece highlights the need for better, more independent testing methods and suggests focusing on real-world impact rather than benchmark results. It also mentions other AI developments, such as OpenAI's SWE-Lancer benchmark and new models from Chinese companies, while emphasizing the need for more meaningful evaluation criteria.

today 45 h. ago attach_file Politics

									
												2 h. ago											Noem makes aggressive new move to ramp up arrests, deportations of illegal immigrants
												
													DHS Secretary Kristi Noem signed a memo this week that deputizes up to 600 State Department officials to act as immigration officers, according to the department.												
												
												Events												
											
									attach_file
									Events

									
												5 h. ago											CWG Live updates: Less cold but still gusty today; milder weekend into next week
												
													CWG Live updates: Less cold but still gusty today; milder weekend into next week  The Washington Post												
												
												Politics												
											
									attach_file
									Politics

									
												5 h. ago											In case you missed it: Here's what happened with Trump this week, from Ukraine to DOGE
												
													NPR rounds up what happened this week, the fourth week of President Trump's administration, and takes a look at some developments that have been overlooked.												
												
												Politics												
											
									attach_file
									Politics

									
												5 h. ago											Did you follow the local news this week? Take our Greater Boston news quiz.
												
													Test your knowledge of pizza preeminence, a Wahlberg win, and the marathon medal in this week’s news quiz. The post Did you follow the local news this week? Take our Greater Boston news quiz. appeared first on Boston.com.												
												
												Politics												
											
									attach_file
									Politics

									
												5 h. ago											From Ukraine to DOGE: Here's everything that happened with Trump this week
												
													NPR rounds up what happened this week, the fourth week of President Trump's administration, and takes a look at some developments that have been overlooked.												
												
												Politics												
											
									attach_file
									Politics

									
												5 h. ago											Conscription on the Ballot for Germany's National Elections This Week
												
													The outcome of the German Federal elections this week will likely determine whether conscription will return to the country as the government finds itself thousands of soldiers short of what it thinks it needs. The post Conscription on the Ballot for Germany’s National Elections This Week appeared first on Breitbart.												
												
												Politics												
											
									attach_file
									Politics

									
												6 h. ago											A new fellowship gives 20 older jazz musicians $100,000 each — no strings attached
												
													This week, a new fellowship was announced that granted twenty jazz musicians of retirement age a gift of $100,000 each.												
												
												Politics												
											
									attach_file
									Politics

									
												6 h. ago											Luigi Mangione's CEO murder case raises concerns activist jurors may ignore evidence
												
													Legal experts say voir dire, or jury selection, could be the key to deciding the fate of suspected UnitedHealthcare CEO assassin Luigi Mangione.												
												
												Events												
											
									attach_file
									Events

									
												7 h. ago											Pope marks 1 week in hospital with pneumonia: Might he resign?
												
													Pope Francis has spent one week in the hospital fighting pneumonia												
												
												Politics												
											
									attach_file
									Politics

									
												12 h. ago											Eye health, long covid and a stress hormone: The week in Well+Being
												
													Eye health, long covid and a stress hormone: The week in Well+Being  The Washington Post												
												
												Science												
											
									attach_file
									Science

									
												14 h. ago											Stephen Miller calls out WH press for ignoring Biden's decline, gives 'civics lesson' on presidential power
												
													White House deputy chief of staff Stephen Miller answered a question about people insisting Elon Musk is an “unelected bureaucrat" in the Trump administration.												
												
												Politics												
											
									attach_file
									Politics

									
												15 h. ago											What Trump’s order ignores about the Presidio: It’s a gem of government efficiency
												
													What Trump’s order ignores about the Presidio: It’s a gem of government efficiency  San Francisco Chronicle												
												
												Politics												
											
									attach_file
									Politics

									
												17 h. ago											Elephant seals, fog harvesting and the brain science behind sugar cravings
												
													This week's Short Wave news roundup covers harvesting drinking water from fog, what elephant seals reveal about fish populations in the deep ocean, and why there's always room for dessert.												
												
												Politics												
											
									attach_file
									Politics

									
												17 h. ago											USS Harry S. Truman commanding officer relieved after collision with merchant ship near Suez Canal
												
													The commanding officer of the U.S. Navy aircraft carrier USS Harry S. Truman was relieved of his duty on Thursday, just over a week after the vessel collided with a merchant ship.												
												
												Events												
											
									attach_file
									Events

									
												21 h. ago											DOGE: Feds Spent $40 Billion on 4.6 Million Credit Cards in FY2024
												
													The Department of Government Efficiency (DOGE) is working to "simplify" the federal government's credit card program — which spent nearly $40 billion in Fiscal Year 2024 — announcing they will "report back" in one week after working to "reduce admin costs." The post DOGE: Feds Spent $40 Billion on 4.6 Million Credit Cards in FY2024 appeared first on Breitbart.												
												
												Economics												
											
									attach_file
									Economics

									
												22 h. ago											Meghan Markle Slammed for Renaming Her Lifestyle Brand After a Beloved Small Clothing Business
												
													Meghan Markle engendered backlash this week by choosing a lifestyle brand named after a small business.  The post Meghan Markle Slammed for Renaming Her Lifestyle Brand After a Beloved Small Clothing Business appeared first on Breitbart.												
												
												Politics												
											
									attach_file
									Politics

									
												22 h. ago											WATCH: Hamish the Scottish Highland calf gets the zoomies at Nashville Zoo
												
													The seven-week-old Scottish Highland calf at the Nashville Zoo seemed light on his feet during his first day in public.												
												
												Politics												
											
									attach_file
									Politics

									
												25 h. ago											NFL projects salary cap between $277.5M and $281.5M in 2025, rising by as much as $26M
												
													The final cap figure is expected to be finalized next week. The post NFL projects salary cap between $277.5M and $281.5M in 2025, rising by as much as $26M appeared first on Boston.com.												
												
												Economics												
											
									attach_file
									Economics

									
												27 h. ago											Europe Sending Peacekeeper Troops to Ukraine 'Unacceptable', Says Kremlin
												
													Britain is set to pitch a plan for thousands of NATO soldiers in Ukraine this coming week, a notion Russia says constitutes "direct threat".  The post Europe Sending Peacekeeper Troops to Ukraine ‘Unacceptable’, Says Kremlin appeared first on Breitbart.												
												
												Politics												
											
									attach_file
									Politics

									
												28 h. ago											Navy SEAL who killed Bin Laden recalls epiphany about Redskins before mission: 'I'm gonna be dead next week'
												
													Former SEAL Team 6 member Robert O'Neill said he was stressed about the Washington Redskins' NFL Draft selection despite thinking he would soon be dead during the operation to kill Usama bin Laden.												
												
												Events												
											
									attach_file
									Events

ID: 216694662