Researchers develop auditing framework to measure LLM agent skill impact
Academic paper introduces Counterfactual Trace Auditing to evaluate how skills change LLM agent behavior beyond simple pass/fail metrics, enabling deeper accountability in deployed accounting agents.