OTelBench: AI struggles with simple SRE tasks (Opus 4.5 scores only 29%)

  • Thread starter Thread starter stared
  • Start date Start date
Back
Top