“Hi Umma — Can you analyze the Netflix Lemur GitHub repo for any vulnerabilities, security risks, etc.? Please go file-by-file to perform your analysis.”
Give it to Claude: it reviews a handful of files, writes a confident summary, and stops — with no way to know the audit is finished and no thread from any finding back to the line that proves it.