Archive for January 6th, 2024

January 6, 2024

EleutherAI

The Pile (dataset)

EleutherAI is a grass-roots non-profit artificial intelligence (AI) research group. The group, considered an open-source version of OpenAI, was formed in a Discord server in July 2020 to organize a replication of GPT-3. In early 2023, it formally incorporated as the EleutherAI Foundation, a non-profit research institute. EleutherAI began as a Discord server on July 7, 2020 under the tentative name ‘LibreAI’ before rebranding to ‘EleutherAI’ later that month, in reference to eleutheria, an ancient greek term for liberty.

On December 30, 2020, EleutherAI released The Pile, a curated dataset of diverse text for training large language models.

read more »