Science

Thousands of Highly Cited Scientists Have At Least One Retraction (nature.com) 28

More than 8,000 of the world's most-cited scientists have at least one retraction, according to a database that links retractions to top-cited papers. From a report: An analysis of the database, published in PLOS Biology on 30 January, attempts to map the scale of retractions and understand how they manifest. "Not every retraction is a sign of misconduct," says John Ioannidis, an epidemiologist at Stanford University in California, who led the study. "But it is important to have a bird's eye view, across all scientific fields, [of] people who are most influential in science."

Retracted papers had a higher number of self-citations than did non-retracted papers. And papers with higher co-authorship numbers were more likely to be retracted than those with fewer co-authors. [...] In the study, the authors split the most-cited scientists into two groups. The first featured the 217,097 authors who were among the top 2% most-cited in their fields over their careers. The second group comprised the 223,152 scientists who made up the top 2% for citation impact in 2023, the most recent year for which there were data. The authors found that 8,747 (4%) of the most highly cited researchers in 2023 had at least one retraction during their career, as did 7,083 (3.3%) of the researchers who were most-cited over their careers.

Data Storage

Archivists Work To Identify and Save the Thousands of Datasets Disappearing From Data.gov (404media.co) 70

An anonymous reader quotes a report from 404 Media: Datasets aggregated on data.gov, the largest repository of U.S. government open data on the internet, are being deleted, according to the website's own information. Since Donald Trump was inaugurated as president, more than 2,000 datasets have disappeared from the database. As people in the Data Hoarding and archiving communities have pointed out, on January 21, there were 307,854 datasets on data.gov. As of Thursday, there are 305,564 datasets. Many of the deletions happened immediately after Trump was inaugurated, according to snapshots of the website saved on the Internet Archive's Wayback Machine. Harvard University researcher Jack Cushman has been taking snapshots of Data.gov's datasets both before and after the inauguration, and has worked to create a full archive of the data.

"Some of [the entries link to] actual data," Cushman told 404 Media. "And some of them link to a landing page [where the data is hosted]. And the question is -- when things are disappearing, is it the data it points to that is gone? Or is it just the index to it that's gone?" For example, "National Coral Reef Monitoring Program: Water Temperature Data from Subsurface Temperature Recorders (STRs) deployed at coral reef sites in the Hawaiian Archipelago from 2005 to 2019," a NOAA dataset, can no longer be found on data.gov but can be found on one of NOAA's websites by Googling the title. "Stetson Flower Garden Banks Benthic_Covage Monitoring 1993-2018 -- OBIS Event," another NOAA dataset, can no longer be found on data.gov and also appears to have been deleted from the internet. "Three Dimensional Thermal Model of Newberry Volcano, Oregon," a Department of Energy resource, is no longer available via the Department of Energy but can be found backed up on third-party websites. [...]

Data.gov serves as an aggregator of datasets and research across the entire government, meaning it isn't a single database. This makes it slightly harder to archive than any individual database, according to Mark Phillips, a University of Northern Texas researcher who works on the End of Term Web Archive, a project that archives as much as possible from government websites before a new administration takes over. "Some of this falls into the 'We don't know what we don't know,'" Phillips told 404 Media. "It is very challenging to know exactly what, where, how often it changes, and what is new, gone, or going to move. Saving content from an aggregator like data.gov is a bit more challenging for the End of Term work because often the data is only identified and registered as a metadata record with data.gov but the actual data could live on another website, a state .gov, a university website, cloud provider like Amazon or Microsoft or any other location. This makes the crawling even more difficult."

Phillips said that, for this round of archiving (which the team does every administration change), the project has been crawling government websites since January 2024, and that they have been doing "large-scale crawls with help from our partners at the Internet Archive, Common Crawl, and the University of North Texas. We've worked to collect 100s of terabytes of web content, which includes datasets from domains like data.gov." [...] It is absolutely true that the Trump administration is deleting government data and research and is making it harder to access. But determining what is gone, where it went, whether it's been preserved somewhere, and why it was taken down is a process that is time intensive and going to take a while. "One thing that is clear to me about datasets coming down from data.gov is that when we rely on one place for collecting, hosting, and making available these datasets, we will always have an issue with data disappearing," Phillips said. "Historically the federal government would distribute information to libraries across the country to provide greater access and also a safeguard against loss. That isn't done in the same way for this government data."

Books

Books Written By Humans Are Getting Their Own Certification (theverge.com) 76

The Authors Guild -- one of the largest associations of writers in the US -- has launched a new project that allows authors to certify that their book was written by a human, and not generated by artificial intelligence. From a report: The Guild says its "Human Authored" certification aims to make it easier for writers to "distinguish their work in increasingly AI-saturated markets," and that readers have a right to know who (or what) created the books they read. Human Authored certifications will be listed in a public database that anyone can access.
Government

OPM Sued Over Privacy Concerns With New Government-Wide Email System (thehill.com) 44

An anonymous reader quotes a report from the Hill: Two federal employees are suing the Office of Personnel Management (OPM) to block the agency from creating a new email distribution system -- an action that comes as the information will reportedly be directed to a former staffer to Elon Musk now at the agency. The suit (PDF), launched by two anonymous federal employees, ties together two events that have alarmed members of the federal workforce and prompted privacy concerns. That includes an unusual email from OPM last Thursday reviewed by The Hill said the agency was testing "a new capability" to reach all federal employees -- a departure from staffers typically being contacted directly by their agency's human resources department.

Also cited in the suit is an anonymous Reddit post Monday from someone purporting to be an OPM employee, saying a new server was installed at their office after a career employee refused to set up a direct line of communication to all federal employees. According to the post, instructions have been given to share responses to the email to OPM chief of staff Amanda Scales, a former employee at Musk's AI company. Federal agencies have separately been directed to send Scales a list of all employees still on their one-year probationary status, and therefore easier to remove from government. The suit says the actions violate the E-Government Act of 2002, which requires a Privacy Impact Assessment before pushing ahead with creation of databases that store personally identifiable information.

Kel McClanahan, executive director of National Security Counselors, a non-profit law firm, noted that OPM has been hacked before and has a duty to protect employees' information. "Because they did that without any indications to the public of how this thing was being managed -- they can't do that for security reasons. They can't do that because they have not given anybody any reason to believe that this server is secure.that this server is storing this information in the proper format that would prevent it from being hacked," he said. McClanahan noted that the emails appear to be an effort to create a master list of federal government employees, as "System of Records Notices" are typically managed by each department. "I think part of the reason -- and this is just my own speculation -- that they're doing this is to try and create that database. And they're trying to sort of create it by smushing together all these other databases and telling everyone who receives the email to respond," he said.

Microsoft

Microsoft Takes on MongoDB with PostgreSQL-Based Document Database (theregister.com) 23

Microsoft has launched an open-source document database platform built on PostgreSQL, partnering with FerretDB as a front-end interface. The solution includes two PostgreSQL extensions: pg_documentdb_core for BSON optimization and pg_documentdb_api for data operations.

FerretDB CEO Peter Farkas said the integration with Microsoft's DocumentDB extension has improved performance twentyfold for certain workloads in FerretDB 2.0. The platform carries no commercial licensing fees or usage restrictions under its MIT license, according to Microsoft.
Privacy

Federal Court Rules Backdoor Searches of 702 Data Unconstitutional (eff.org) 42

A federal district court has ruled that backdoor searches of Americans' private communications collected under Section 702 of FISA are unconstitutional without a warrant. "The landmark ruling comes in a criminal case, United States v. Hasbajrami, after more than a decade of litigation, and over four years since the Second Circuit Court of Appeals found that backdoor searches constitute 'separate Fourth Amendment events' and directed the district court to determine a warrant was required," reports the Electronic Frontier Foundation (EFF). "Now, that has been officially decreed." Longtime Slashdot reader schwit1 shares the report: Hasbajrami involves a U.S. resident who was arrested at New York JFK airport in 2011 on his way to Pakistan and charged with providing material support to terrorists. Only after his original conviction did the government explain that its case was premised in part on emails between Mr. Hasbajrami and an unnamed foreigner associated with terrorist groups, emails collected warrantless using Section 702 programs, placed in a database, then searched, again without a warrant, using terms related to Mr. Hasbajrami himself.

The district court found that regardless of whether the government can lawfully warrantlessly collect communications between foreigners and Americans using Section 702, it cannot ordinarily rely on a "foreign intelligence exception" to the Fourth Amendment's warrant clause when searching these communications, as is the FBI's routine practice. And, even if such an exception did apply, the court found that the intrusion on privacy caused by reading our most sensitive communications rendered these searches "unreasonable" under the meaning of the Fourth Amendment. In 2021 alone, the FBI conducted 3.4 million warrantless searches of US person's 702 data.

Businesses

IMDb Founder Steps Down As CEO After 35 Years (techcrunch.com) 24

IMDb founder Col Needham is stepping down as CEO after 35 years, transitioning to executive chair. He will be succeeded by Nikki Santoro, who has served as the chief operating officer since 2021. TechCrunch reports: Santoro's appointment is significant, as she is the first woman to become the CEO and only the second person to hold the position. Needham founded IMDb in 1990 at the age of 23, steering the company into a powerhouse within the entertainment industry. After 35 years, he'll transition to a new role as executive chair.

According to Needham, Santoro's ascension is well deserved. [...] Santoro has been with the company since 2016, leading the company in expanding its database and improving its IMDbPro membership. She previously held leadership positions at Amazon, Microsoft, and The Weather Channel.
"Nikki's strategic vision, deep understanding of our customers and products, and commitment to innovation have already delivered impressive business results during her tenure as COO," said Needham in a statement. "Her track record of driving growth and enhancing our products and services makes her the ideal person to guide IMDb into a new era."
Transportation

Aptera's Solar-Powered Electric Car Shown at CES, Finally Nears Production (motortrend.com) 122

"Engineers have showcased a prototype electric vehicle that can drive for up to 40 miles (64 kilometers) per day using just solar power," reports LiveScience. The production-ready "Aptera Launch Edition" made its first appearance this month at CES 2025, and "also offers up to 400 miles (640 km) of range from a single charge via an electrical output, company representatives said in a statement."

LiveScience describes the vehicle as "lighter and more energy-efficient than conventional EVs, while offering a 50% reduction in aerodynamic resistance," with an energy efficiency rating of 100 Watt-hours per mile (Wh/mile). By contrast, a Tesla Model S (released in 2022) consumes 194 Wh/mile in the city in mild weather and 288 Wh/mile on the highway in mild weather, according to the EV Database. At a maximum range of 440 miles — including 40 miles using solar power and 400 miles using electricity — the Aptera EV may also overtake the current longest-range vehicles in production. The Mercedes-Benz EQS 450+ has a maximum range of 425 miles (684 km), according to the EV Database, followed by the Lucid Air Grand Touring at 410 miles (660 km).
Aptera says it's raised $135 million "through equity crowdfunding" to fund its pre-production progress. "Since its launch, the Company has accepted $1.7 billion in pre-orders with nearly 50,000 vehicles reserved by future Aptera owners in the U.S. and internationally."

MotorTrend writes that "nearly two decades in the making, the otherworldly three-wheel Aptera is headed to production this year as a $40,000, 400-mile EV that can capture up to 40 miles worth of free solar energy every day. Maybe." The California startup made similar promises in 2008, 2009, 2012, and 2022 and yet it has never delivered a single vehicle. Is anything different this time...?

At CES, co-CEO (and one of Aptera's original founders) Chris Anthony told MotorTrend it will take another $60 million to finish the development work, buy the tooling, and build out the Carlsbad, California, assembly plant. "We're still in fundraising mode and we hope that we inspire some people in this beautiful building (Las Vegas Convention Center) to invest in Aptera," Anthony said. "We're trying to raise $20 million in the first quarter of this year. That will basically kick off all the long-lead items to get into production, but it's a $60 million plan to get into volume production." Anthony said the company has already made one of its largest purchases, the molds for the carbon-fiber sheet-molding composite body structure and the fiberglass sheet-molding composite body panels that will be made in Italy. The next $20 million will cover the tooling for the diecast metal suspension arms and the injection-molded interior components...

It would be relatively easy for Aptera to hand build cars in a garage and announce the start of production, but the plan calls for building up to 80 cars per day per the guidance of engineering consultant and YouTuber Sandy Munro, who is an Aptera investor and adviser. "He really helped shepherd the design from what was an early prototype prove-out design into how to make the most manufacturable vehicle ever," Anthony said. The structure is built from just six parts and the entire car has been designed to be put together in a factory with just 12 stations. But that radical simplicity complicates the job at hand right now. In addition to developing the car, the small engineering team also has to create the machine that makes it. Anthony's plan has the factory ramping up to build 20,000 vehicles a year within nine months of starting production at the end of 2025.

Before that can happen, Aptera needs to clear the same hurdle that tripped it up in 2011 and sent the company stumbling into liquidation — the money. "We would love one investor to be so inspired by what we're doing that they just hand us a $60 million check," Anthony told MotorTrend. "But it could be something that's kind of piecemeal over the next nine months to get that $60 million into the company." Are you convinced?

AI

World's First AI Chatbot, ELIZA, Resurrected After 60 Years (livescience.com) 37

"Scientists have just resurrected 'ELIZA,' the world's first chatbot, from long-lost computer code," reports LiveScience, "and it still works extremely well." (Click in the vintage black-and-green rectangle for a blinking-cursor prompt...) Using dusty printouts from MIT archives, these "software archaeologists" discovered defunct code that had been lost for 60 years and brought it back to life. ELIZA was developed in the 1960s by MIT professor Joseph Weizenbaum and named for Eliza Doolittle, the protagonist of the play "Pygmalion," who was taught how to speak like an aristocratic British woman.

As a language model that the user could interact with, ELIZA had a significant impact on today's artificial intelligence (AI), the researchers wrote in a paper posted to the preprint database arXiv Sunday (Jan. 12). The "DOCTOR" script written for ELIZA was programmed to respond to questions as a psychotherapist would. For example, ELIZA would say, "Please tell me your problem." If the user input "Men are all alike," the program would respond, "In what way."

Weizenbaum wrote ELIZA in a now-defunct programming language he invented, called Michigan Algorithm Decoder Symmetric List Processor (MAD-SLIP), but it was almost immediately copied into the language Lisp. With the advent of the early internet, the Lisp version of ELIZA went viral, and the original version became obsolete. Experts thought the original 420-line ELIZA code was lost until 2021, when study co-author Jeff Shrager, a cognitive scientist at Stanford University, and Myles Crowley, an MIT archivist, found it among Weizenbaum's papers. "I have a particular interest in how early AI pioneers thought," Shrager told Live Science in an email. "Having computer scientists' code is as close to having a record of their thoughts, and as ELIZA was — and remains, for better or for worse — a touchstone of early AI, I want to know what was in his mind...."

Even though it was intended to be a research platform for human-computer communication, "ELIZA was such a novelty at the time that its 'chatbotness' overwhelmed its research purposes," Shrager said.

I just remember that time 23 years ago when someone connected a Perl version of ELIZA to "an AOL Instant Messenger account that has a high rate of 'random' people trying to start conversations" to "put ELIZA in touch with the real world..."

Thanks to long-time Slashdot reader MattSparkes for sharing the news.
Privacy

PowerSchool Data Breach Victims Say Hackers Stole 'All' Historical Student and Teacher Data (techcrunch.com) 21

An anonymous reader shares a report: U.S. school districts affected by the recent cyberattack on edtech giant PowerSchool have told TechCrunch that hackers accessed "all" of their historical student and teacher data stored in their student information systems. PowerSchool, whose school records software is used to support more than 50 million students across the United States, was hit by an intrusion in December that compromised the company's customer support portal with stolen credentials, allowing access to reams of personal data belonging to students and teachers in K-12 schools.

The attack has not yet been publicly attributed to a specific hacker or group. PowerSchool hasn't said how many of its school customers are affected. However, two sources at affected school districts -- who asked not to be named -- told TechCrunch that the hackers accessed troves of personal data belonging to both current and former students and teachers.
Further reading: Lawsuit Accuses PowerSchool of Selling Student Data To 3rd Parties.
AI

OpenAI's Bot Crushes Seven-Person Company's Website 'Like a DDoS Attack' 78

An anonymous reader quotes a report from TechCrunch: On Saturday, Triplegangers CEO Oleksandr Tomchuk was alerted that his company's e-commerce site was down. It looked to be some kind of distributed denial-of-service attack. He soon discovered the culprit was a bot from OpenAI that was relentlessly attempting to scrape his entire, enormous site. "We have over 65,000 products, each product has a page," Tomchuk told TechCrunch. "Each page has at least three photos." OpenAI was sending "tens of thousands" of server requests trying to download all of it, hundreds of thousands of photos, along with their detailed descriptions. "OpenAI used 600 IPs to scrape data, and we are still analyzing logs from last week, perhaps it's way more," he said of the IP addresses the bot used to attempt to consume his site. "Their crawlers were crushing our site," he said "It was basically a DDoS attack."

Triplegangers' website is its business. The seven-employee company has spent over a decade assembling what it calls the largest database of "human digital doubles" on the web, meaning 3D image files scanned from actual human models. It sells the 3D object files, as well as photos -- everything from hands to hair, skin, and full bodies -- to 3D artists, video game makers, anyone who needs to digitally recreate authentic human characteristics. [...] To add insult to injury, not only was Triplegangers knocked offline by OpenAI's bot during U.S. business hours, but Tomchuk expects a jacked-up AWS bill thanks to all of the CPU and downloading activity from the bot.
Triplegangers initially lacked a properly configured robots.txt file, which allowed the bot to freely scrape its site since the system interprets the absence of such a file as permission. It's not an opt-in system.

Once the file was updated with specific tags to block OpenAI's bot, along with additional defenses like Cloudflare, the scraping stopped. However, robots.txt is not foolproof since compliance by AI companies is voluntary, leaving the burden on website owners to monitor and block unauthorized access proactively. "[Tomchuk] wants other small online business to know that the only way to discover if an AI bot is taking a website's copyrighted belongings is to actively look," reports TechCrunch.
Privacy

Database Tables of Student, Teacher Info Stolen From PowerSchool In Cyberattack (theregister.com) 18

An anonymous reader quotes a report from The Register: A leading education software maker has admitted its IT environment was compromised in a cyberattack, with students and teachers' personal data -- including some Social Security Numbers and medical info -- stolen. PowerSchool says its cloud-based student information system is used by 18,000 customers around the globe, including the US and Canada, to handle grading, attendance records, and personal information of more than 60 million K-12 students and teachers. On December 28 someone managed to get into its systems and access their contents "using a compromised credential," the California-based biz told its clients in an email seen by Register this week.

[...] "We believe the unauthorized actor extracted two tables within the student information system database," a spokesperson told us. "These tables primarily include contact information with data elements such as name and address information for families and educators. "For a certain subset of the customers, these tables may also include Social Security Number, other personally identifiable information, and limited medical and grade information. "Not all PowerSchool student information system customers were impacted, and we anticipate that only a subset of impacted customers will have notification obligations."
While the company has tightened security measures and offered identity protection services to affected individuals, cybersecurity firm Cyble suggests the intrusion "may have been more serious and gone on much longer than has been publicly acknowledged so far," reports The Register. The cybersecurity vendor says the intrusion could have occurred as far back as June 16, 2011, with it ending on January 2 of this year.

"Critical systems and applications such as Oracle Netsuite ERP, HR software UltiPro, Zoom, Slack, Jira, GitLab, and sensitive credentials for platforms like Microsoft login, LogMeIn, Windows AD Azure, and BeyondTrust" may have been compromised, too.
Piracy

Science Paper Piracy Site Sci-Hub Shares Lots of Retracted Papers (arstechnica.com) 48

The shift from paywalled to open-access scientific publishing is progressing, driven in part by platforms like Sci-Hub -- a website that allows users to upload PDFs of published papers and share them with anyone. While the shadow library website has faced ongoing attempts by publishers to block access, it has another problem: the platform features many outdated or retracted papers that could spread misinformation or flawed findings. Ars Technica reports: Sci-Hub works a bit like a combination of cache and aggregator for published materials. Whenever it gets a request for a paper that's not already in its database, it uses leaked login credentials to go to the website of whatever journal published the paper and obtain a copy. If it already has a copy, however, it will simply serve that up instead. This leaves open the possibility that it will have obtained a copy of a paper prior to its retraction and continue to distribute that copy after the paper has been retracted.

To check this, the researchers obtained a list of nearly 17,000 retracted papers and searched for them on Sci-Hub. They then visually examined the documents that were returned. They found that 85 percent of them contained no indication that the paper had been retracted. "The availability of [unlabeled retracted articles] in the field of health sciences is particularly high," they note, "which indicates a significant risk of their unintended use and further citation in future research."

While corrections are less severe than retractions, they're likely to suffer a similar problem. And corrections will often involve the technical details of a paper -- the experimental approaches or raw data that will be critical for anyone wanting to replicate or extend previously published results. So, if anything, their impact will be more significant.
Ars notes that a system called Crossmark is available to help find the most up-to-date version of a paper, including any corrections or retraction notices.
Open Source

New York Times Recognizes Open-Source Maintainers With 2024 'Good Tech' Award (thestar.com.my) 7

This week New York Times technology columnist Kevin Roose published his annual "Good Tech" awards to "shine the spotlight on a few tech projects that I think contributed positively to humanity."

And high on the list is "Andres Freund, and every open-source software maintainer saving us from doom." The most fun column I wrote this past year was about a Microsoft database engineer, Andres Freund, who got some odd errors while doing routine maintenance on an obscure open-source software package called xz Utils. While investigating, Freund inadvertently discovered a huge security vulnerability in the Linux operating system, which could have allowed a hacker to take control of hundreds of millions of computers and bring the world to its knees.

It turns out that much of our digital infrastructure rests on similar acts of nerdy heroism. After writing about Freund's discovery, I received tips about other near disasters involving open-source software projects, many of which were averted by sharp-eyed volunteers catching bugs and fixing critical code just in time to foil the bad guys. I could not write about them all, but this award is to say: I see you, open-source maintainers, and I thank you for your service.

Roose also acknowledges the NASA engineers who kept Voyager 1 transmitting back to earth from interstellar space — and Bluesky, "for making my social media feeds interesting again."

Roose also notes it was a big year for AI. There's a shout-out to Epoch AI, a small nonprofit research group in Spain, "for giving us reliable data on the AI boom." ("The firm maintains public databases of AI models and AI hardware, and publishes research on AI trends, including an influential report last year about whether AI models can continue to grow at their current pace. Epoch AI concluded they most likely could until 2030.") And there's also a shout-out to groups "pushing AI forward" and positive uses "to improve health care, identify new drugs and treatments for debilitating diseases and accelerate important scientific research."
  • The nonprofit Arc Institute released Evo, an AI model that "can predict and generate genomic sequences, using technology similar to the kind that allows systems like ChatGPT to predict the next words in a sequence."
  • A Harvard University lab led by Dr. Jeffrey Lichtman teamed with researchers from Google for "the most detailed map of a human brain sample ever created. The team used AI to map more than 150 million synapses in a tiny sample of brain tissue at nanometer-level resolution..."
  • Researchers at Stanford and McMaster universities developed SyntheMol, "a generative AI model that can design new antibiotics from scratch."

AI

How AI-Based Military Intelligence Powered Israel's Attacks on Gaza (msn.com) 131

It's "what some experts consider the most advanced military AI initiative ever to be deployed," reports the Washington Post.

But the Israeli military's AI-powered intelligence practices are also "under scrutiny. Genocide charges against Israel brought to The Hague by South Africa question whether crucial decisions about bombing targets in Gaza were made by software, an investigation that could hasten a global debate about the role of AI technology in warfare." After the brutal Oct. 7, 2023, attack by Hamas, the Israel Defense Forces deluged Gaza with bombs, drawing on a database painstakingly compiled through the years that detailed home addresses, tunnels and other infrastructure critical to the militant group. But then the target bank ran low. To maintain the war's breakneck pace, the IDF turned to an elaborate artificial intelligence tool called Habsora — or "the Gospel" — which could quickly generate hundreds of additional targets. The use of AI to rapidly refill IDF's target bank allowed the military to continue its campaign uninterrupted, according to two people familiar with the operation. It is an example of how the decade-long program to place advanced AI tools at the center of IDF's intelligence operations has contributed to the violence of Israel's 14-month war in Gaza... People familiar with the IDF's practices, including soldiers who have served in the war, say Israel's military has significantly expanded the number of acceptable civilian casualties from historic norms. Some argue this shift is enabled by automation, which has made it easier to speedily generate large quantities of targets, including of low-level militants who participated in the Oct. 7 attacks.
In a statement to The Post, the IDF argued that "If anything, these tools have minimized collateral damage and raised the accuracy of the human-led process." The IDF requires an officer to sign off on any recommendations from its "big data processing" systems, according to an intelligence official who spoke on the condition of anonymity because Israel does not release division leaders' names. The Gospel and other AI tools do not make decisions autonomously, the person added...Recommendations that survive vetting by an intelligence analyst are placed in the target bank by a senior officer...

Another machine learning tool, called Lavender, uses a percentage score to predict how likely a Palestinian is to be a member of a militant group, allowing the IDF to quickly generate a large volume of potential human targets... The rule mandating two pieces of human-derived intelligence to validate a prediction from Lavender was dropped to one at the outset of the war, according to two people familiar with the efforts. In some cases in the Gaza division, soldiers who were poorly trained in using the technology attacked human targets without corroborating Lavender's predictions at all, the soldier said.

The article includes an ominous quote from Steven Feldstein, a senior fellow at the Carnegie Endowment who researches the use of AI in war. Feldstein acknowledges questions of accuracy, but also notes the accelerated speed of the systems, and the ultimate higher death count. His conclusion?

"What's happening in Gaza is a forerunner of a broader shift in how war is being fought."
Privacy

Cloudflare's VPN App Among Half-Dozen Pulled From Indian App Stores (techcrunch.com) 12

More than half-a-dozen VPN apps, including Cloudflare's widely-used 1.1.1.1, have been pulled from India's Apple App Store and Google Play Store following intervention from government authorities, TechCrunch reported Friday. From the report: The Indian Ministry of Home Affairs issued removal orders for the apps, according to a document reviewed by TechCrunch and a disclosure made by Google to Lumen, Harvard University's database that tracks government takedown requests globally.
The Military

Missiles Are Now the Biggest Killer of Airline Passengers (wsj.com) 79

Accidental missile attacks on commercial airliners have become the leading cause of aviation fatalities in recent years (Warning: source paywalled; alternative source), driven by rising global conflicts and the proliferation of advanced antiaircraft weaponry. Despite improvements in aviation safety overall, inconsistent risk assessments, political complexities, and rapid military escalations make protecting civilian flights in conflict zones increasingly difficult. The Wall Street Journal reports: The crash Wednesday of an Azerbaijan Airlines jetliner in Kazakhstan, if officially confirmed as a midair attack, would be the third major fatal downing of a passenger jet linked to armed conflict since 2014, according to the Flight Safety Foundation's Aviation Safety Network, a global database of accidents and incidents. The tally would bring to more than 500 the number of deaths from such attacks during that period. Preliminary results of Azerbaijan's investigation into the crash indicate the plane was hit by a Russian antiaircraft missile, or shrapnel from it, said people briefed on the probe.

"It adds to the worrying catalog of shootdowns now," said Andy Blackwell, an aviation risk adviser at security specialist ISARR and former head of security at Virgin Atlantic. "You've got the conventional threats, from terrorists and terrorist groups, but now you've got this accidental risk as well." No other cause of aviation fatalities on commercial airliners comes close to shootdowns over those years, according to ASN data. The deadliness of such attacks is a dramatic shift: In the preceding 10 years, there were no fatal shootdowns of scheduled commercial passenger flights, ASN data show. The trend highlights the difficulty -- if not impossibility -- of protecting civilian aviation in war zones, even for rigorous aviation regulators, because of the politics of war. Early last century similar woes plagued sea travel, when belligerents targeted ocean transport.

Increasing civilian aviation deaths from war also reflect both a growing number of armed conflicts internationally and the increasing prevalence of powerful antiaircraft weaponry. If a missile was indeed the cause of this week's disaster, it would mean that the three deadliest shootdowns of the past decade all involved apparently unintended targetings of passenger planes flying near conflict zones, by forces that had been primed to hit enemy military aircraft. Two of those incidents were linked to Russia: Wednesday's crash of an Embraer E190 with 67 people aboard, of whom 38 died, and the midair destruction in 2014 of a Malaysia Airlines Boeing 777 flying over a battle zone in Ukraine, on which all 298 people aboard died. The other major downing was the mistaken shooting in 2020 by Iranian forces of a Ukraine International Airlines Boeing 737 departing Tehran, killing all 176 people onboard. Iran's missile defense systems had been on alert for a potential U.S. strike at the time.

Books

Encyclopedia Britannica Is Now an AI Company 59

An anonymous reader quotes a report from Gizmodo: Once an icon of the 20th century seen as obsolete in the 21st, Encyclopedia Britannica -- now known as just Britannica -- is all in on artificial intelligence, and may soon go public at a valuation of nearly $1 billion, according to the New York Times.

Until 2012 when printing ended, the company's books served as the oldest continuously published, English-language encyclopedias in the world, essentially collecting all the world's knowledge in one place before Google or Wikipedia were a thing. That has helped Britannica pivot into the AI age, where models benefit from access to high-quality, vetted information. More general-purpose models like ChatGPT suffer from hallucinations because they have hoovered up the entire internet, including all the junk and misinformation.

While it still offers an online edition of its encyclopedia, as well as the Merriam-Webster dictionary, Britannica's biggest business today is selling online education software to schools and libraries, the software it hopes to supercharge with AI. That could mean using AI to customize learning plans for individual students. The idea is that students will enjoy learning more when software can help them understand the gaps in their understanding of a topic and stay on it longer. Another education tech company, Brainly, recently announced that answers from its chatbot will link to the exact learning materials (i.e. textbooks) they reference.

Britannica's CEO Jorge Cauz also told the Times about the company's Britannica AI chatbot, which allows users to ask questions about its vast database of encyclopedic knowledge that it collected over two centuries from vetted academics and editors. The company similarly offers chatbot software for customer service use cases. Britannica told the Times it is expecting revenue to double from two years ago, to $100 million.
DRM

Takedown Notices Hit Luigi Mangione Merchandise and Photos - Including DMCAs (404media.co) 100

Newsweek supplies some context After his arrest, merch — including T-shirts featuring Mangione's booking photos and others taken from his social media accounts — began popping up for sale on several sites. Websites, including Amazon, eBay and Etsy, have moved to take down products that glorify violence or the suspect. An eBay spokesperson told Newsweek that "items that glorify or incite violence, including those that celebrate the recent murder of UHC CEO Brian Thompson, are prohibited."
Inc. magazine adds: Separately, GoFundMe has shuttered several fundraising campaigns created for Mangione. The fundraising site's terms and conditions are pretty clear on the matter, NBC News reports, with a company spokesperson explaining they prohibit "fundraisers for the legal defense of violent crimes."
But one incident was different, according to a post from the law school of the University of British Columbia: To provide a quick summary, Rachel Kenaston, an artist selling merch on TeePublic received an e-mail from the platform regarding intellectual property claim by UnitedHealth Group Inc and decided to remove Kenaston's design from the merch store. Obviously, it is important to point out that it isn't quite clear who is filing those DMCA claims. While TeePublic, in the email, claimed that they have no say in the matter, [an article from 404 Media] goes on to explain that TeePublic has the right to refuse DMCA claims, but often choose not to in order to avoid headache. The design had nothing to do with UnitedHealthcare-it seems to be a picture of the Mangione in a heart frame. Meaning, whether it was UnitedHealthcare or not, the claim shouldn't hold any weight.

Consensus seems to be mostly leaning towards speculation that it is unlikely to be UnitedHealthcare actually filing those DMCA claims, but rather potential competitors... Regardless of whether or not it really was UnitedHealthcare that filed DMCA claims, I think the important point here is that the merch actually did get taken down. In fact, this would be more problematic if it was from a competitor using DMCA as a form of removing competition, because, then it really has nothing to do with intellectual property. I would assume that this happens quite frequently. Especially for YouTubers, it seems that copyright strikes are more than a mere pesky occurrence, but for many, something that affects livelihood...

The difficult part, as always, is finding the balance between protecting the rights of the copyright holders and ensuring that the mechanisms doesn't get abused.

The artist told Gizmodo she was filing a counterclaim to the copyright notice, adding that instead of a DMCA, "I honestly expected the design to be pulled for condoning violence or something..."

Gizmodo published the image — a watercolored rendition of a hostel surveillance-camera photo released by police — adding "UnitedHealth Group didn't respond to questions emailed on Monday [December 16] about how the company could possibly claim a copyright violation had occurred." And while Gizmodo promised they'd update the post if UnitedHealth responded — there has been no update since...

404 Media adds that the watercolor "is not the only United Healthcare or Luigi Mangione-themed artwork on the internet that has been hit with bogus DMCA takedowns in recent days. Several platforms publish the DMCA takedown requests they get on the Lumen Database, which is a repository of DMCA takedowns." On December 7, someone named Samantha Montoya filed a DMCA takedown with Google that targeted eight websites selling "Deny, Defend, Depose" merch that uses elements of the United Healthcare logo... Medium, one of the targeted websites, has deleted the page that the merch was hosted on...

Over the weekend, a lawyer demanded that independent journalist Marisa Kabas take down an image of Luigi Mangione and his family that she posted to Bluesky, which was originally posted on the campaign website of Maryland assemblymember Nino Mangione. The lawyer, Desiree Moore, said she was "acting on behalf of our client, the Doe Family," and claimed that "the use of this photograph is not authorized by the copyright owner and is not otherwise permitted by law..." In a follow-up email to Kabas, Moore said "the owner of the photograph has not authorized anyone to publish, disseminate, or otherwise use the photograph for any purpose, and the photograph has been removed from various digital platforms as a result," which suggests that other websites have also been threatened with takedown requests. Moore also said that her "client seeks to remain anonymous" and that "the photograph is hardly newsworthy."

404 Media believes the takedown request "shows that the Mangione family or someone associated with it is using the prospect of a copyright lawsuit to threaten journalists for reporting on one of the most important stories of the year..."

UPDATE: Long-time Slashdot reader destinyland notes there's an interesting precedent from 2007: [D]eep within the DMCA law is a counter-provision — 512(f), which states that misrepresenting yourself as a copyright owner has consequences. Any damage caused by harmful misrepresentation must be reimbursed. In 2004 the Electronic Frontier Foundation won a six-figure award from Diebold Election Systems, who had claimed a "copyright" on embarrassing internal memos which were published online.
The Internet

Months After Its 20th Anniversary, OpenStreetMap Suffers an Extended Outage (openstreetmap.org) 10

Monday long-time Slashdot reader denelson83 wrote: The crowdsourced, widely-used map database OpenStreetMap has had a hardware failure at its upstream ISP in Amsterdam and has been put into a protective read-only mode to avoid loss or corruption of data. .
The outage had started Sunday December 15 at 4:00AM (GMT/UTC), but by Tuesday they'd posted a final update: Our new ISP is up and running and we have started migrating our servers across to it. If all goes smoothly we hope to have all services back up and running this evening...

We have dual redundant links via separate physical hardware from our side to our Tier 1 ISP. We unexpectedly discovered their equipment is a single point failure. Their extended outage is an extreme disappointment to us.

We are an extremely small team. The OSMF budget is tiny and we could definitely use more help. Real world experience... Ironically we signed a contract with a new ISP in the last few days. Install is on-going (fibre runs, modules & patching) and we expect to run old and new side-by-side for 6 months. Significantly better resilience (redundant ISP side equipment, VRRP both ways, multiple upstream peers... 2x diverse 10G fibre links).

OpenStreetMap celebrated its 20th anniversary in August, with a TechCrunch profile reminding readers the site gives developers "geographic data and maps so they can rely a little less on the proprietary incumbents in the space," reports TechCrunch, adding "Yes, that mostly means Google."

OpenStreetMap starts with "publicly available and donated aerial imagery and maps, sourced from governments and private organizations such as Microsoft" — then makes them better: Today, OpenStreetMap claims more than 10 million contributors who map out and fine-tune everything from streets and buildings, to rivers, canyons and everything else that constitutes our built and natural environments... Contributors can manually add and edit data through OpenStreetMap's editing tools, and they can even venture out into the wild and map a whole new area by themselves using GPS, which is useful if a new street crops up, for example...

OpenStreetMap's Open Database License allows any third-party to use its data with the appropriate attribution (though this attribution doesn't always happen). This includes big-name corporations such as Apple and VC-backed unicorns like MapBox, through a who's who of tech companies, including Uber and Strava... More recently, the Overture Maps Foundation — an initiative backed by Microsoft, Amazon, Meta and TomTom — has leaned heavily on OpenStreetMap data as part of its own efforts to build a viable alternative to Google's walled mapping garden.

The article notes that OpenStreetMap is now overseen by the U.K.-based nonprofit OpenStreetMap Foundation (supported mainly by donations and memberships), with just one employee — a system engineer — "and a handful of contractors who provide administrative and accounting support."

In August its original founder Steve Coast, returned to the site for a special blog post on its 20th anniversary: OpenStreetMap has grown exponentially or quadratically over the last twenty years depending on the metric you're interested in... The story isn't so much about the data and technology, and it never was. It's the people... OpenStreetMap managed to map the world and give the data away for free for almost no money at all. It managed to sidestep almost all the problems that Wikipedia has by virtue of only representing facts not opinions. The project itself is remarkable. And it's wonderful that so many are in love with it.
"Two decades ago, I knew that a wiki map of the world would work," Coast writes. "It seemed obvious in light of the success of Wikipedia and Linux..."

Slashdot Top Deals