Accurate, Focused Research on Law, Technology and Knowledge Discovery Since 2002

Category Archives: Patent and Trademark

Google Patent Phrase Similarity Dataset

Kaggle: “This is a human rated contextual phrase to phrase matching dataset focused on technical terms from patents. In addition to similarity scores that are typically included in other benchmark datasets we include granular rating classes similar to WordNet, such as synonym, antonym, hypernym, hyponym, holonym, meronym, domain related. The dataset was used in the U.S. Patent Phrase to Phrase Matching competition. The dataset was generated with focus on the following:

  • Phrase disambiguation: certain keywords and phrases can have multiple different meanings. For example, the phrase “mouse” may refer to an animal or a computer input device. To help disambiguate the phrases we have included Cooperative Patent Classification (CPC) classes with each pair of phrases.
  • Adversarial keyword match: there are phrases that have matching keywords but are otherwise unrelated (e.g. “container section” → “kitchen container”, “offset table” → “table fan”). Many models will not do well on such data (e.g. bag of words models). Our dataset is designed to include many such examples.
  • Hard negatives: We created our dataset with the aim to improve upon current state of the art language models. Specifically, we have used the BERT model to generate some of the target phrases. So our dataset contains many human rated examples of phrase pairs that BERT may identify as very similar but in fact they may not be.
  • Each entry of the dataset contains two phrases – anchor and target, a context CPC class, a rating class, and a similarity score…”

Modern, user-friendly Patent Center to fully replace legacy Public PAIR system this summer

USPTO: “Beginning August 1, 2022, the U.S. Patent and Trademark Office’s (USPTO) Patent Center system—available to the public since 2017—will fully replace the legacy Public Patent Application Information Retrieval (Public PAIR) tool for the electronic filing and management of patent applications. The Public PAIR tool, first launched in the early 2000s, will be officially retired on July… Continue Reading

USPTO launches new Patent Public Search tool and webpage

“The United States Patent and Trademark Office (USPTO) today announced a new Patent Public Search tool that provides more convenient, remote, and robust full-text searching of all U.S. patents and published patent applications. Based on the advanced Patents End-to-End (PE2E) search tool USPTO examiners use to identify prior art, this free, cloud-based platform combines the capabilities of… Continue Reading

The Lumen database collects and analyzes legal complaints and requests for removal of online materials

Lumen.org: “Lumen collects and studies online content removal requests, providing transparency and supporting analysis of the Web’s takedown “ecology,” in terms of who sends requests, why, and to what ends. Lumen seeks to facilitate research about different kinds of complaints and requests for removal – legitimate and questionable – that are being sent to Internet publishers,… Continue Reading

Artificial Intelligence Patent Dataset

“To assist researchers and policymakers focusing on the determinants and impacts of artificial intelligence (AI) invention, OCE released two data files, collectively called the Artificial Intelligence Patent Dataset (AIPD). The first data file identifies United States (U.S.) patents issued between 1976 and 2020 and pre-grant publications (PGPubs) published through 2020 that contain one or more… Continue Reading

Patent Law: An Open-Source Casebook (Entire Book)

Janis, Mark David and Sichelman, Ted M. and Allison, John R. and Cotter, Thomas F. and Cotropia, Christopher Anthony and Karshtedt, Dmitry and Lefstin, Jeffrey A. and Rantanen, Jason and Taylor, David O. and Tu, Shine (Sean), Patent Law: An Open-Source Casebook (Entire Book) (May 6, 2021). UC Hastings Research Paper, Forthcoming, Available at SSRN:… Continue Reading

EUIPO’s TMview database expands to Chinese market

“As of … the 19 May 2021, TMview will include trade mark data made available by the China National Intellectual Property Administration (CNIPA), taking the total number of trade marks in the search tool from 62 to over 90 million from 75 participating IP Offices. Over 32 million Chinese trade marks are now available in the… Continue Reading

USPTO chief information officer most excited about new search algorithms

FedScoop – “New search algorithms for relevant prior art most excite the U.S. Patent and Trademark Office’s CIO right now. USPTO created the machine-learning algorithms to increase the speed at which patents are examined by importing relevant prior art — all information on its claim of originality — into pending applications sent to art units, said Jamie Holcombe. Filtering… Continue Reading

Pete Recommends – Weekly highlights on cyber security issues, August 15, 2020

Via LLRX – Pete Recommends – Weekly highlights on cyber security issues, August 15, 2020 – Privacy and security issues impact every aspect of our lives – home, work, travel, education, health and medical records – to name but a few. On a weekly basis Pete Weiss highlights articles and information that focus on the… Continue Reading