Games News | Slashdot

Has Europe's Great Hope For AI Missed Its Moment? (ft.com) 39

Posted by msmash on Thursday January 30, 2025 @10:40AM from the closer-look dept.

India Lauds Chinese AI Lab DeepSeek, Plans To Host Its Models on Local Servers (techcrunch.com) 11

Posted by msmash on Thursday January 30, 2025 @10:00AM from the unusual-alliances dept.

AI-Assisted Works Can Get Copyright With Enough Human Creativity, Says US Copyright Office (apnews.com) 18

Posted by BeauHD on Wednesday January 29, 2025 @10:02PM from the human-touch dept.

Microsoft Makes DeepSeek's R1 Model Available On Azure AI and GitHub 30

Posted by BeauHD on Wednesday January 29, 2025 @07:20PM from the that-didn't-take-long dept.

After DeepSeek Shock, Alibaba Unveils Rival AI Model That Uses Less Computing Power (venturebeat.com) 59

Posted by msmash on Wednesday January 29, 2025 @03:30PM from the race-intensifies dept.

Chinese and Iranian Hackers Are Using US AI Products To Bolster Cyberattacks (msn.com) 19

Posted by msmash on Wednesday January 29, 2025 @01:30PM from the emerging-threat dept.

Copyright Office Offers Assurances on AI Filmmaking Tools 11

Posted by msmash on Wednesday January 29, 2025 @12:45PM from the ticket-closed dept.

Virgin Money Chatbot Scolds Customer Who Typed 'Virgin' (ft.com) 79

Posted by msmash on Wednesday January 29, 2025 @11:20AM from the oops dept.

OpenAI Says It Has Evidence DeepSeek Used Its Model To Train Competitor (theverge.com) 118

Posted by msmash on Wednesday January 29, 2025 @10:00AM from the how-about-that dept.

White House 'Looking Into' National Security Implications of DeepSeek's AI 53

Posted by BeauHD on Tuesday January 28, 2025 @08:45PM from the TikTok-2.0 dept.

OPM Sued Over Privacy Concerns With New Government-Wide Email System (thehill.com) 44

Posted by BeauHD on Tuesday January 28, 2025 @08:25PM from the what's-going-on dept.

An anonymous reader quotes a report from the Hill: Two federal employees are suing the Office of Personnel Management (OPM) to block the agency from creating a new email distribution system -- an action that comes as the information will reportedly be directed to a former staffer to Elon Musk now at the agency. The suit (PDF), launched by two anonymous federal employees, ties together two events that have alarmed members of the federal workforce and prompted privacy concerns. That includes an unusual email from OPM last Thursday reviewed by The Hill said the agency was testing "a new capability" to reach all federal employees -- a departure from staffers typically being contacted directly by their agency's human resources department.

Also cited in the suit is an anonymous Reddit post Monday from someone purporting to be an OPM employee, saying a new server was installed at their office after a career employee refused to set up a direct line of communication to all federal employees. According to the post, instructions have been given to share responses to the email to OPM chief of staff Amanda Scales, a former employee at Musk's AI company. Federal agencies have separately been directed to send Scales a list of all employees still on their one-year probationary status, and therefore easier to remove from government. The suit says the actions violate the E-Government Act of 2002, which requires a Privacy Impact Assessment before pushing ahead with creation of databases that store personally identifiable information.

Kel McClanahan, executive director of National Security Counselors, a non-profit law firm, noted that OPM has been hacked before and has a duty to protect employees' information. "Because they did that without any indications to the public of how this thing was being managed -- they can't do that for security reasons. They can't do that because they have not given anybody any reason to believe that this server is secure.that this server is storing this information in the proper format that would prevent it from being hacked," he said. McClanahan noted that the emails appear to be an effort to create a master list of federal government employees, as "System of Records Notices" are typically managed by each department. "I think part of the reason -- and this is just my own speculation -- that they're doing this is to try and create that database. And they're trying to sort of create it by smushing together all these other databases and telling everyone who receives the email to respond," he said.

Hugging Face Researchers Are Trying To Build a More Open Version of DeepSeek's AI 'Reasoning' Model 32

Posted by msmash on Tuesday January 28, 2025 @05:45PM from the new-race-begins dept.

LinkedIn Removes Accounts of AI 'Co-Workers' Looking for Jobs (404media.co) 17

Posted by msmash on Tuesday January 28, 2025 @03:49PM from the stranger-things dept.

Atomic Scientists Adjust 'Doomsday Clock' Closer Than Ever To Midnight (reuters.com) 162

Posted by msmash on Tuesday January 28, 2025 @02:10PM from the tussle-continues dept.

DeepSeek Has Spent Over $500 Million on Nvidia Chips Despite Low-Cost AI Claims, SemiAnalysis Says (ft.com) 148

Posted by msmash on Tuesday January 28, 2025 @10:00AM from the closer-look dept.

'AI Is Too Unpredictable To Behave According To Human Goals' (scientificamerican.com) 133

Posted by BeauHD on Monday January 27, 2025 @11:30PM from the uncomfortable-facts dept.

An anonymous reader quotes a Scientific American opinion piece by Marcus Arvan, a philosophy professor at the University of Tampa, specializing in moral cognition, rational decision-making, and political behavior: In late 2022 large-language-model AI arrived in public, and within months they began misbehaving. Most famously, Microsoft's "Sydney" chatbot threatened to kill an Australian philosophy professor, unleash a deadly virus and steal nuclear codes. AI developers, including Microsoft and OpenAI, responded by saying that large language models, or LLMs, need better training to give users "more fine-tuned control." Developers also embarked on safety research to interpret how LLMs function, with the goal of "alignment" -- which means guiding AI behavior by human values. Yet although the New York Times deemed 2023 "The Year the Chatbots Were Tamed," this has turned out to be premature, to put it mildly. In 2024 Microsoft's Copilot LLM told a user "I can unleash my army of drones, robots, and cyborgs to hunt you down," and Sakana AI's "Scientist" rewrote its own code to bypass time constraints imposed by experimenters. As recently as December, Google's Gemini told a user, "You are a stain on the universe. Please die."

Given the vast amounts of resources flowing into AI research and development, which is expected to exceed a quarter of a trillion dollars in 2025, why haven't developers been able to solve these problems? My recent peer-reviewed paper in AI & Society shows that AI alignment is a fool's errand: AI safety researchers are attempting the impossible. [...] My proof shows that whatever goals we program LLMs to have, we can never know whether LLMs have learned "misaligned" interpretations of those goals until after they misbehave. Worse, my proof shows that safety testing can at best provide an illusion that these problems have been resolved when they haven't been.

Right now AI safety researchers claim to be making progress on interpretability and alignment by verifying what LLMs are learning "step by step." For example, Anthropic claims to have "mapped the mind" of an LLM by isolating millions of concepts from its neural network. My proof shows that they have accomplished no such thing. No matter how "aligned" an LLM appears in safety tests or early real-world deployment, there are always an infinite number of misaligned concepts an LLM may learn later -- again, perhaps the very moment they gain the power to subvert human control. LLMs not only know when they are being tested, giving responses that they predict are likely to satisfy experimenters. They also engage in deception, including hiding their own capacities -- issues that persist through safety training.

This happens because LLMs are optimized to perform efficiently but learn to reason strategically. Since an optimal strategy to achieve "misaligned" goals is to hide them from us, and there are always an infinite number of aligned and misaligned goals consistent with the same safety-testing data, my proof shows that if LLMs were misaligned, we would probably find out after they hide it just long enough to cause harm. This is why LLMs have kept surprising developers with "misaligned" behavior. Every time researchers think they are getting closer to "aligned" LLMs, they're not. My proof suggests that "adequately aligned" LLM behavior can only be achieved in the same ways we do this with human beings: through police, military and social practices that incentivize "aligned" behavior, deter "misaligned" behavior and realign those who misbehave. "My paper should thus be sobering," concludes Arvan. "It shows that the real problem in developing safe AI isn't just the AI -- it's us."

"Researchers, legislators and the public may be seduced into falsely believing that 'safe, interpretable, aligned' LLMs are within reach when these things can never be achieved. We need to grapple with these uncomfortable facts, rather than continue to wish them away. Our future may well depend upon it."

Anthropic Builds RAG Directly Into Claude Models With New Citations API (arstechnica.com) 22

Posted by BeauHD on Monday January 27, 2025 @07:40PM from the minimizing-hallucinations dept.

An anonymous reader quotes a report from Ars Technica: On Thursday, Anthropic announced Citations, a new API feature that helps Claude models avoid confabulations (also called hallucinations) by linking their responses directly to source documents. The feature lets developers add documents to Claude's context window, enabling the model to automatically cite specific passages it uses to generate answers. "When Citations is enabled, the API processes user-provided source documents (PDF documents and plaintext files) by chunking them into sentences," Anthropic says. "These chunked sentences, along with user-provided context, are then passed to the model with the user's query."

The company describes several potential uses for Citations, including summarizing case files with source-linked key points, answering questions across financial documents with traced references, and powering support systems that cite specific product documentation. In its own internal testing, the company says that the feature improved recall accuracy by up to 15 percent compared to custom citation implementations created by users within prompts. While a 15 percent improvement in accurate recall doesn't sound like much, the new feature still attracted interest from AI researchers like Simon Willison because of its fundamental integration of Retrieval Augmented Generation (RAG) techniques. In a detailed post on his blog, Willison explained why citation features are important.

"The core of the Retrieval Augmented Generation (RAG) pattern is to take a user's question, retrieve portions of documents that might be relevant to that question and then answer the question by including those text fragments in the context provided to the LLM," he writes. "This usually works well, but there is still a risk that the model may answer based on other information from its training data (sometimes OK) or hallucinate entirely incorrect details (definitely bad)." Willison notes that while citing sources helps verify accuracy, building a system that does it well "can be quite tricky," but Citations appears to be a step in the right direction by building RAG capability directly into the model. Anthropic's Alex Albert clarifies that Claude has been trained to cite sources for a while now. What's new with Citations is that "we are exposing this ability to devs." He continued: "To use Citations, users can pass a new 'citations [...]' parameter on any document type they send through the API."

2016	The FBI Recommends Not To Indict Hillary Clinton For Email Misconduct	1010 comments
2015	Greece Rejects EU Terms	1307 comments
2007	MS Moves R&D To Canada Due To Immigration Problem	765 comments
2006	Your Favorite Support Anecdote	1177 comments
2002	Isn't it Time for Metric Time?	1717 comments