Nearly 12,000 API keys and passwords found in AI training dataset

Industry news
Post Reply
rbc
Secretary
Posts: 374
Joined: Mon Oct 30, 2023 1:32 am
Location: Vicksburg, MS
ISC2 Member Status: Yes
Contact:

Nearly 12,000 API keys and passwords found in AI training dataset

Post by rbc »

Close to 12,000 valid secrets that include API keys and passwords have been found in the Common Crawl dataset used for training multiple artificial intelligence models.

The Common Crawl non-profit organization maintains a massive open-source repository of petabytes of web data collected since 2008 and is free for anyone to use.

Because of the large dataset, many artificial intelligence projects may rely, at least in part, on the digital archive for training large language models (LLMs), including ones from OpenAI, DeepSeek, Google, Meta, Anthropic, and Stability.
[...]
Nearly 12,000 API keys and passwords found in AI training dataset
Robert B. Carleton + ISC2 Central Mississippi Secretary
Post Reply