Via Data Machina via Data is Plural – 1 million ChatGPT conversations. “The WildChat Dataset, constructed by Wenting Zhao et al., “is a corpus of 1 million real-world user-ChatGPT interactions, characterized by a wide range of languages and a diversity of user prompts.” The researchers, primarily affiliated with Cornell and the Allen Institute for AI, built it “by offering free access to ChatGPT and GPT-4 in exchange for consensual chat history collection.” Each of the 1 million rows in the dataset represents a conversation and provides its text, main language, timestamp of its conclusion, underlying model used, moderation results, inferred country, and more.”
Sorry, comments are closed for this post.