Games News | Slashdot

Study Accuses LM Arena of Helping Top AI Labs Game Its Benchmark (techcrunch.com) 10

Posted by msmash on Thursday May 01, 2025 @09:00AM from the jig-is-up dept.

Duolingo Doubles Its Language Courses Thanks To AI 51

Posted by BeauHD on Thursday May 01, 2025 @06:00AM from the would-you-look-at-that dept.

Microsoft Puts Brakes on AI Spending as Profit Increases 18% 7

Posted by BeauHD on Wednesday April 30, 2025 @07:40PM from the putting-on-the-brakes dept.

Google Funding Electrician Training As AI Power Crunch Intensifies 34

Posted by BeauHD on Wednesday April 30, 2025 @05:40PM from the national-shortage dept.

Microsoft CEO Says Up To 30% of the Company's Code Was Written by AI (techcrunch.com) 149

Posted by msmash on Wednesday April 30, 2025 @01:34PM from the new-world-order dept.

Wikipedia To Use AI (wikimediafoundation.org) 40

Posted by msmash on Wednesday April 30, 2025 @11:20AM from the how-about-that dept.

Gen AI Is Not Replacing Jobs Or Hurting Wages At All, Say Economists 108

Posted by BeauHD on Wednesday April 30, 2025 @09:00AM from the would-you-look-at-that dept.

An anonymous reader quotes a report from The Register: Instead of depressing wages or taking jobs, generative AI chatbots like ChatGPT, Claude, and Gemini have had almost no wage or labor impact so far -- a finding that calls into question the huge capital expenditures required to create and run AI models. In a working paper released earlier this month, economists Anders Humlum and Emilie Vestergaard looked at the labor market impact of AI chatbots on 11 occupations, covering 25,000 workers and 7,000 workplaces in Denmark in 2023 and 2024.

Many of these occupations have been described as being vulnerable to AI: accountants, customer support specialists, financial advisors, HR professionals, IT support specialists, journalists, legal professionals, marketing professionals, office clerks, software developers, and teachers. Yet after Humlum, assistant professor of economics at the Booth School of Business, University of Chicago, and Vestergaard, a PhD student at the University of Copenhagen, analyzed the data, they found the labor and wage impact of chatbots to be minimal. "AI chatbots have had no significant impact on earnings or recorded hours in any occupation," the authors state in their paper.

The report should concern the tech industry, which has hyped AI's economic potential while plowing billions into infrastructure meant to support it. Early this year, OpenAI admitted that it loses money per query even on its most expensive enterprise SKU, while companies like Microsoft and Amazon are starting to pull back on their AI infrastructure spending in light of low business adoption past a few pilots. The problem isn't that workers are avoiding generative AI chatbots -- quite the contrary. But they simply aren't yet equating to actual economic benefits. "The adoption of these chatbots has been remarkably fast," Humlum told The Register. "Most workers in the exposed occupations have now adopted these chatbots. Employers are also shifting gears and actively encouraging it. But then when we look at the economic outcomes, it really has not moved the needle."

Humlum said while there are gains and time savings to be had, "there's definitely a question of who they really accrue to. And some of it could be the firms -- we cannot directly look at firm profitability. Some of it could also just be that you save some time on existing tasks, but you're not really able to expand your output and therefore earn more. So it's like it saves you time writing emails. But if you cannot really take on more work or do something else that is really valuable, then that will put a damper on how much we should actually expect those time savings to affect your earning ability, your total hours, your wages."

"In terms of economic outcomes, when we're looking at hard metrics -- in the administrative labor market data on earnings, wages -- these tools have really not made a difference so far," said Humlum. "So I think that that puts in some sense an upper bound on what return we should expect from these tools, at least in the short run. My general conclusion is that any story that you want to tell about these tools being very transformative, needs to contend with the fact that at least two years after [the introduction of AI chatbots], they've not made a difference for economic outcomes."

Google Play Sees 47% Decline In Apps Since Start of Last Year (techcrunch.com) 69

Posted by BeauHD on Tuesday April 29, 2025 @08:50PM from the compare-and-contrast dept.

Google Play's app marketplace has seen a dramatic 47% drop in available apps -- from 3.4 million to 1.8 million -- since the start of 2024. An analysis by app intelligence provider Appfigures attributes the decline to stricter quality standards, expanded human reviews, and increased enforcement against low-quality and deceptive apps. TechCrunch reports: In July 2024, Google announced it would raise the minimum quality requirements for apps, which may have impacted the number of available Play Store app listings.

Instead of only banning broken apps that crashed, wouldn't install, or run properly, the company said it would begin banning apps that demonstrated "limited functionality and content." That included static apps without app-specific features, such as text-only apps or PDF file apps. It also included apps that provided little content, like those that only offered a single wallpaper. Additionally, Google banned apps that were designed to do nothing or have no function, which may have been tests or other abandoned developer efforts.

Reached for comment, Google confirmed that its new policies were factors here, which also included an expanded set of verification requirements, required app testing for new personal developer accounts, and expanded human reviews to check for apps that try to deceive or defraud users. In addition, the company pointed to other 2024 investments in AI for threat detection, stronger privacy policies, improved developer tools, and more. As a result, Google prevented 2.36 million policy-violating apps from being published on its Play Store and banned more than 158,000 developer accounts that had attempted to publish harmful apps, it said. TechCrunch also notes that a new trader status rule, which went into effect in the EU this February, could be another contributing factor. It requires developers to display their names and addresses in their app listings, and failure to comply would see their apps removed from EU app stores.

OpenAI's o3 Model Beats Master-Level Geoguessr Player 32

Posted by BeauHD on Tuesday April 29, 2025 @06:10PM from the not-too-shabby dept.

In a blog post yesterday, Master I-ranked human GeoGuessr player Sam Patterson said that OpenAI's o3 model outscored him in a head-to-head match, "correctly identifying all five countries and twice landing within a few hundred meters." Geoguessing is a game -- most popularly known through the platform GeoGuessr -- where players are dropped into a random location in Google Street View and must figure out where in the world they are using only visual clues from the environment. With the release of its newest AI models, o3 and o4-mini, OpenAI now does a surprisingly good job of analyzing uploaded images to determine their locations using nothing but subtle visual clues.

"Even when I embedded fake GPS coordinates in the image EXIF, the model ignored the spoof and still pinpointed the real locations, showing its performance comes from visual reasoning and on-the-fly web sleuthing -- not hidden metadata," says Patterson. From the post: I notice that it often does a lot of unnecessary and repetitive cropping, and will sometimes spend way too much time on something unimportant. A human is very good at knowing what matters, and o3 is less knowledgeable about what things it should focus on. It got distracted by advertising multiple times. However, most of what it says about things like signs and road lines appears to be accurate, or at least close enough to truth that they meaningfully add up. Given the end result of these excellent guesses, it seems to arrive at the guesses from that information.

If it's using other information to arrive at the guess, then it's not metadata from the files, but instead web search. It seems likely that in the Austria round, the web search was meaningful, since it mentioned the website named the town itself. It appeared less meaningful in the Ireland round. It was still very capable in the rounds without search.

So to put a bow on this:
- The o3 model isn't smoke and mirrors, tricking us by only using EXIF data. It's at a comparable Geoguessr skill level to Master I or better players now (at least according to my own ~20 or so rounds of testing).
- Humans still hold a big edge in decision time -- most of my guesses were 4 min.
- Spoofing EXIF data doesn't throw off the model.

Whether you view this as dystopian or as a technological marvel -- or both -- you can't claim it's a parlor trick.

Mastercard Gives AI Agents Ability To Shop Online for You (financialpost.com) 49

Posted by msmash on Tuesday April 29, 2025 @04:50PM from the brave-new-world dept.

Firefox Finally Delivers Tab Groups Feature (mozilla.org) 47

Posted by msmash on Tuesday April 29, 2025 @04:05PM from the fwiw dept.

AI-Generated Code Creates Major Security Risk Through 'Package Hallucinations' (arstechnica.com) 34

Posted by msmash on Tuesday April 29, 2025 @03:25PM from the side-effects dept.

India Court Orders Proton Mail Block On Security Grounds (livelaw.in) 20

Posted by msmash on Tuesday April 29, 2025 @01:30PM from the escalating-matters dept.

Reddit Issuing 'Formal Legal Demands' Against Researchers Who Conducted Secret AI Experiment on Users 36

Posted by msmash on Tuesday April 29, 2025 @12:01PM from the action-creates-consequences dept.

OpenAI-Microsoft Alliance Fractures as AI Titans Chart Separate Paths (wsj.com) 14

Posted by msmash on Tuesday April 29, 2025 @06:00AM from the closer-look dept.

Duolingo Will Replace Contract Workers With AI 70

Posted by BeauHD on Monday April 28, 2025 @09:00PM from the would-you-look-at-that dept.

OpenAI Upgrades ChatGPT Search With Shopping Features (techcrunch.com) 29

Posted by BeauHD on Monday April 28, 2025 @07:00PM from the watch-out-Google dept.

China's Huawei Develops New AI Chip, Seeking To Match Nvidia (wsj.com) 55

Posted by msmash on Monday April 28, 2025 @01:27PM from the shape-of-things-to-come dept.

Unauthorized AI Bot Experiment Infiltrated Reddit To Test Persuasion Capabilities (404media.co) 82

Posted by msmash on Monday April 28, 2025 @12:43PM from the how-about-that dept.

IBM Pledges $150 Billion US Investment (reuters.com) 42

Posted by msmash on Monday April 28, 2025 @11:23AM from the big-dreams dept.

2008	AVG Fakes User Agent, Floods the Internet	928 comments
2007	Ocarina of Time — Best Game Ever?	615 comments
2006	Nerds Switching from Apple to Ubuntu?	957 comments
2005	Windows Software Ugly, Boring & Uninspired	924 comments
2003	Protecting Cities from Hijacked Planes	971 comments

Slashdot Top Deals