GitHub's AI Hunger: Your Private Code Is Now Training Data by Default
Microsoft's GitHub is set to implement a policy change on April 24 that allows the platform to utilize user interaction data for AI training purposes. This includes inputs, outputs, and specific code snippets along with their associated context. This shift to an opt-out model means that developers' contributions and interactions will automatically contribute to the evolution of GitHub's AI features unless settings are adjusted. The move highlights the increasing pressure on tech giants to source vast amounts of data to refine large language models, though it places the burden of privacy on the individual user.