Accurate, Focused Research on Law, Technology and Knowledge Discovery Since 2002

Category Archives: Intellectual Property

There’s No Longer Any Doubt That Hollywood Writing Is Powering AI

The Atlantic – Dialogue from these movies and TV shows has been used by companies such as Apple and Anthropic to train AI systems [unpaywalled] By Alex Reisner – “I can now say with absolute confidence that many AI systems have been trained on TV and film writers’ work. Not just on The Godfather and Alf, but on more than 53,000 other movies and 85,000 other TV episodes: Dialogue from all of it is included in an AI-training data set that has been used by Apple, Anthropic, Meta, Nvidia, Salesforce, Bloomberg, and other companies. I recently downloaded this data set, which I saw referenced in papers about the development of various large language models (or LLMs). It includes writing from every film nominated for Best Picture from 1950 to 2016, at least 616 episodes of The Simpsons, 170 episodes of Seinfeld, 45 episodes of Twin Peaks, and every episode of The Wire, The Sopranos, and Breaking Bad. It even includes prewritten “live” dialogue from Golden Globes and Academy Awards broadcasts. If a chatbot can mimic a crime-show mobster or a sitcom alien—or, more pressingly, if it can piece together whole shows that might otherwise require a room of writers—data like this are part of the reason why.”

Metropolitan Museum of Art Puts 490,000 High-Res Images Online & Makes Them Free to Use

Open Culture: “The Metropolitan Museum of Art has put online 492,000 high-resolution images of artistic works. Even better, the museum has placed the vast majority of these images into the public domain, meaning they can be downloaded directly from the museum’s website for non-commercial use. When you browse the Met collection and find an image… Continue Reading

Inside Redbox’s insane bankruptcy unwinding

Sherwood: “Ever wanted to own 46 copies of Orlando Bloom’s latest movie? What about a dozen empty Redbox DVD cases? Or maybe an entire Redbox kiosk, free with local pickup? It’s all up for grabs, thanks to Redbox’s recent demise. The chain of DVD-rental kiosks filed for bankruptcy in June after racking up close to… Continue Reading

Law and Technological Innovations: Three Reasons to Pause

Smith, Michael L., (September 04, 2024). 12 Belmont Law Review (Forthcoming 2025), Available at SSRN: https://ssrn.com/abstract=4946479 or http://dx.doi.org/10.2139/ssrn.4946479 – “Faced with optimistic accounts of technological innovations, businesses, law firms, and governments face pressure to rush into adopting these technologies and enjoying the increased efficiency, reduced costs, and other benefits that are promised. This essay sets… Continue Reading

LLMs don’t do formal reasoning and that is a HUGE problem

Marcus on AI: “A superb new article on LLMs from six AI researchers at Apple who were brave enough to challenge the dominant paradigm has just come out. Everyone actively working with AI should read it, or at least this terrific X thread by senior author, Mehrdad Farajtabar, that summarizes what they observed. One key… Continue Reading

FTC Sends Refunds to Consumers Who Bought Pyrex Glass Manufacturer’s Products Falsely Advertised as Made in USA

FTC: The Federal Trade Commission is sending more than $88,000 in refunds to consumers who bought Chinese-made measuring cups marketed as “Made in USA” by Instant Brands, the maker of Pyrex-brand kitchen and home products. The FTC took action against Instant Brands in 2023 charging that the company claimed that all its popular glass measuring… Continue Reading

Unlocking AI for All: The Case for Public Data Banks

LawFare: “The data relied on by OpenAI, Google, Meta, and other artificial intelligence (AI) developers is not readily available to other AI labs. Google and Meta relied, in part, on data gathered from their own products to train and fine-tune their models. OpenAI used tactics to acquire data that now would not work or may… Continue Reading

Inside Iron Mountain: It’s Time to Talk About Hard Drives

MIX: “A few years ago, archiving specialist Iron Mountain Media and Archive Services did a survey of its vaults and discovered an alarming trend: Of the thousands and thousands of archived hard disk drives from the 1990s that clients ask the company to work on, around one-fifth are unreadable. Iron Mountain has a broad customer… Continue Reading

Academic Journal Publishers Antitrust Litigation

Press release: “On September 12, 2024, Lieff Cabraser and co-counsel at Justice Catalyst Law filed a federal antitrust lawsuit against six commercial publishers of academic journals, including Elsevier, Springer Nature, Taylor and Francis, Sage, Wiley, and Wolters Kluwer, on behalf of a proposed class of scientists and scholars who provided manuscripts or peer review, alleging… Continue Reading

When A.I.’s Output Is a Threat to A.I. Itself

The New York Times – As A.I.-generated data becomes harder to detect, it’s increasingly likely to be ingested by future A.I., leading to worse results. ” The internet is becoming awash in words and images generated by artificial intelligence. Sam Altman, OpenAI’s chief executive, wrote in February that the company generated about 100 billion words… Continue Reading

New web crawler launched by Meta last month is quietly scraping the internet for AI training data

Fortune [no paywall]: “Meta has quietly unleashed a new web crawler to scour the internet and collect data en masse to feed its AI model. The crawler, named the Meta External Agent, was launched last month according to three firms that track web scrapers and bots across the web. The automated bot essentially copies, or… Continue Reading

EU Proposal for an ePrivacy Regulation

“The European Commission’s proposal for a Regulation on ePrivacy aims at reinforcing trust and security in the digital world. Why a reform of ePrivacy legislation? European legislation needs to keep up with the fast pace at which IT-based services are developing and evolving. The Commission has started a major modernisation process of the data protection… Continue Reading