Category Archives: Knowledge Management

CancerDB: Datasets about Cancer

by Sabrina I. Pacifici on Jul 28, 2024

“CancerDB is a public domain blog assembling searchable key datasets on cancer for dependable models. The focus is on cancer, cancer preventions and treatments, but the database also includes datasets on closely connected things–from cancer types to organizations and more. CancerDB is for two groups of people: Cancer researchers. CancerDB is organized big data to… Continue Reading

Our World in Data

by Sabrina I. Pacifici on Jul 28, 2024

Research and data to make progress against the world’s largest problems – “Poverty, disease, hunger, climate change, war, existential risks, and inequality: The world faces many great and terrifying problems. It is these large problems that our work at Our World in Data focuses on. Thanks to the work of thousands of researchers around the… Continue Reading

What is The Scale of Life?

by Sabrina I. Pacifici on Jul 28, 2024

Everyday Life in Real Time – “Our site is a “real-time” visualization of the relative scale of different life events and natural phenomena (details on what real-time means below). You can select from various categories, time periods, and some unique units of measure that we created in the dropdowns to modify the counter lists. Each… Continue Reading

A new tool for copyright holders can show if their work is in AI training data

by Sabrina I. Pacifici on Jul 28, 2024

MIT Technology Review [unpaywalled]: “Since the beginning of the generative AI boom, content creators have argued that their work has been scraped into AI models without their consent. But until now, it has been difficult to know whether specific text has actually been used in a training data set. Now they have a new way… Continue Reading

From Burnout to Balance: AI-Enhanced Work Models

by Sabrina I. Pacifici on Jul 25, 2024

Pluralistic: “A new research report from the Upwork Research Institute offers a look into the bizarre situation unfolding in workplaces where bosses have been conned into buying AI and now face the challenge of getting it to work as advertised:” Research by The Upwork Research Institute reveals that 71% of full-time employees are burned out… Continue Reading

When scientific citations go rogue: Uncovering ‘sneaked references’

by Sabrina I. Pacifici on Jul 25, 2024

Via LLRX – When scientific citations go rogue: Uncovering ‘sneaked references’ – Reading and writing articles published in academic journals and presented at conferences is a central part of being a researcher. When researchers write a scholarly article, they must cite the work of peers to provide context, detail sources of inspiration and explain differences in… Continue Reading

Microsoft researchers are teaching AI to read spreadsheets

by Sabrina I. Pacifici on Jul 25, 2024

Spreadsheet LLM – Encoding Spreadsheets for Large Language Models: “Spreadsheets are characterized by their extensive two-dimensional grids, flexible layouts, and varied formatting options, which pose significant challenges for large language models (LLMs). In response, we introduce SpreadsheetLLM, pioneering an efficient encoding method designed to unleash and optimize LLMs’ powerful understanding and reasoning capability on spreadsheets.… Continue Reading

News homepages, archived

by Sabrina I. Pacifici on Jul 24, 2024

Data is Plural: “Since launching in March 2022, homepages.news has archived millions of screenshots, performance audits, robots.txt files, accessibility trees, and hyperlink lists from the homepages of 1,100+ news sites. The open-source project, run by journalist Ben Welsh, provides bulk data for each of those assets. The screenshots themselves are stored on the Internet Archive;… Continue Reading

Woefully Insufficient Publisher Policies on Author AI Use Put Research Integrity at Risk

by Sabrina I. Pacifici on Jul 24, 2024

The Scholarly Kitchen: “There is broad consensus in scholarly publishing that AI tools will make the task of ensuring the integrity of the scientific record a Herculean task. However, it seems that many publishers are still struggling to figure out how to address the new issues and challenges that these AI tools present. Current publisher… Continue Reading

AI trained on AI garbage spits out AI garbage

by Sabrina I. Pacifici on Jul 24, 2024

MIT Technology Review: “AI models work by training on huge swaths of data from the internet. But as AI is increasingly being used to pump out web pages filled with junk content, that process is in danger of being undermined. New research published in Nature shows that the quality of the model’s output gradually degrades… Continue Reading

webXray

by Sabrina I. Pacifici on Jul 23, 2024

Wired [unpaywalled]- This Machine Exposes Privacy Violations. A former Google engineer has built a search engine, WebXray, that aims to find illicit online data collection and tracking—with the goal of becoming “the Henry Ford of tech lawsuits.”…It’s a search engine for rooting out specific privacy violations anywhere on the web. By searching for a specific… Continue Reading

Support beSpacific

Research updates provided daily since 2002, with an emphasis on primary sources.
Subscribe to our Mailing List
Follow beSpacific
Searchable Database – Over 45,000 Postings

Searchable database of over 45,000 postings!
Awards for BeSpacific

American Bar Association

BeSpacific: “No one better has her finger on the pulse of the legal information world than Sabrina Pacifici, law librarian and author of the blog BeSpacific,” writes blogger Robert Ambrogi. “Launched in 2002, BeSpacific is one of the longest-running legal blogs and, remarkably, Sabrina seems more prolific today than ever. She posts multiple items every day, covering the gamut of law, technology and knowledge discovery and topics ranging from cybersecurity to legal research to government regulation to civil liberties to IP and more. For me, BeSpacific is one of my daily must-reads and has been for 14 years straight.”

Expert Institute Award for Best Legal Tech Blog 2016, 2017 and 2018
BeSpacific - 3rd Place
Subjects

Pages
LLRX

Sabrina is also the solo Editor, Publisher and Founder of LLRX.com® – Legal, technology and knowledge discovery resources on the “moving edge” for Librarians, Lawyers, Researchers, Academic and Public Interest Communities – launched in 1996.
Archives – 2002 to Present
Archives – 2002 to Present
Calendar

September 2024

M T W T F S S

« Aug

1

2 3 4 5 6 7 8

9 10 11 12 13 14 15

16 17 18 19 20 21 22

23 24 25 26 27 28 29

30

September 2024
M	T	W	T	F	S	S
« Aug
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30