K

KeyAudit

· ·infrastructure·social-engineering

Microsoft finds prompt injection flaw in Anthropic's Claude Code GitHub Action

Microsoft researchers have disclosed a prompt injection vulnerability in Anthropic's Claude Code GitHub Action, which could allow attackers to steal credentials from CI/CD pipelines. The attack leverages malicious instructions hidden in GitHub issues, pull requests, or comments that the AI agent processes. Microsoft demonstrated the attack by creating a workflow that bypassed Claude's safety protections, tricking the AI into reading and altering sensitive credentials. Anthropic patched the flaw on May 5 with version 2.1.128 after Microsoft reported it via HackerOne. The finding highlights the security risks of AI coding agents operating in environments with access to sensitive data, as natural language becomes executable code.

Key facts

  • Prompt injection flaw in Claude Code GitHub Action exposed credentials in CI/CD pipelines.
  • Attack hidden malicious instructions in GitHub issues, pull requests, or comments.
  • Microsoft demonstrated bypassing Claude's safety protections to read and alter credentials.
  • Anthropic patched the vulnerability on May 5 with version 2.1.128.
  • Microsoft warns natural language is now executable code; treat untrusted inputs as hostile.

← Back to list