Google Accused of Letting OpenAI Use YouTube Videos to Train GPT-4 - In an effort to secure high-quality data to train their AI models, AI companies such as OpenAI, Google, and Meta have used tactics that are considered unclear. A New York Times report states that OpenAI reportedly transcribed more than a million hours of YouTube videos to apply the data to train its most advanced large language model (LLM), GPT-4. Reportedly, OpenAI developed the Whisper audio transcription model, which helps companies extract data from YouTube videos. The NY Times reports that OpenAI knew that this method might be subject to scrutiny, but they went ahead with it anyway because they believed it was fair use. Interestingly, Google, the owner of YouTube, is also suspected of engaging in similar practices in its AI models, thereby violating creators' copyrights, quoted from Neowin. The NY Times report aligns with The Information's report, which highlighted that
Knowledge Karomah Laduni & News