4 items across 4 digests
AI models frequently provide correct answers while citing incorrect source materials in their responses. This source attribution problem undermines the reliability of AI systems for research and fact-checking applications.
GPT-5.5 achieves top benchmark scores but maintains a 20 percent hallucination rate at 20 percent higher API costs than previous versions. This cost-performance trade-off forces enterprises to weigh accuracy improvements against budget constraints in AI deployment decisions.
GPT-5.5 achieves top benchmark performance but costs 20 percent more than previous API pricing while maintaining frequent hallucination issues. This pricing increase signals that advanced AI capabilities will require significantly higher operational investments from businesses integrating these models.
MIT researchers developed a new training method that improves AI confidence estimates without sacrificing performance, addressing hallucination in reasoning models. This breakthrough could increase enterprise adoption of AI systems by making their outputs more reliable and trustworthy.