Perplexity Outperforms OpenAI, Anthropic with ‘Sonar’ API

Sonar has beaten its rivals on speed, user satisfaction, and factuality and readability of search results

4
Perplexity Outperforms OpenAI, Anthropic with 'Sonar' GenAI
Generative AILatest News

Published: February 14, 2025

Luke Williams

January was significant in the evolution of AI-powered search, with the unveiling of the latest version of Sonar, Perplexity’s proprietary search API. This week, Sonar was made available to all Perplexity Pro users.

Built on the foundation of Llama 3.3 70B, Sonar has already demonstrated remarkable success in combining speed, accuracy, and enhanced user experience. The release represents the culmination of months of intensive development and testing, delivering a search solution that could turbocharge how users find and interact with information.

Unprecedented Performance, Lightning Speed

Initial user feedback and extensive A/B testing have shown that Sonar’s capabilities surpass models in its class like GPT-4o and Claude 3.5 Sonnet. What distinguishes Sonar is its extraordinary ability to deliver this performance at an astounding speed of 1,200 tokens per second – nearly 10 times faster than comparable solutions like Gemini 2.0 Flash.

Image from Perplexity

This breakthrough in speed, made possible through the partnership with Cerebras inference infrastructure, enables near-instantaneous answer generation. Early adopters have reported that the impact on user experience is transformative, with response times that match the speed of human thought and enable more natural, flowing interactions.

Superior User Satisfaction Through Enhanced Capabilities

The development team’s commitment to user satisfaction has driven two key optimisations in Sonar’s development, both of which have received positive feedback from early users:

Enhanced Factuality: Sonar excels at grounding answers in verified search results; skillfully handling conflicting information and filling information gaps with unerring accuracy. The model employs advanced verification algorithms to cross-reference information across multiple sources, ensuring the highest standard of factual accuracy.

Optimised Readability: The model has been fine-tuned to deliver concise yet comprehensive answers, utilising intelligent markdown formatting to present information in easily digestible formats. Sonar’s responses are structured to highlight key information while maintaining natural language flow, making complex information accessible and engaging without sacrificing depth or nuance.

Image from Perplexity

Since its release, Sonar has consistently outperformed GPT-4o mini and Claude 3.5 Haiku across all key metrics, matched or exceeded Claude 3.5 Sonnet’s performance, and approached GPT-4o’s capabilities while operating at a fraction of the cost. The model has demonstrated superior performance on academic benchmarks; showcasing its versatility and reliability across diverse use cases.

How Might Business benefit?

  1. Real-time Customer Support: Perplexity could improve the accuracy of product information retrieval, multilingual support capabilities, automated ticket categorisation and routing, and sophisticated customer sentiment analysis with response optimisation.
  2. Research and Analysis: Sonar displays advanced capabilities in rapid market research synthesis, competitive intelligence gathering, real-time data verification, trend identification and analysis, patent and intellectual property research, and comprehensive academic literature review and summarisation. The speed and accuracy of these processes should allowed teams to focus more on strategic decision-making rather than data gathering.
  3. Content Creation and Management: Content teams using Sonar could benefit from efficiency gains in automated content summarisation, SEO-optimised writing assistance, technical documentation generation, multi-format content adaptation, brand voice consistency checking, and content localisation support.
  4. Decision Support: Implemented and used correctly, Sonar has the potential to transform business decision-making processes through quick fact-checking for meetings, real-time information verification, trend analysis and insights generation, risk assessment and mitigation, regulatory compliance checking, and market opportunity identification. The speed of information retrieval will be particularly valuable in time-sensitive situations.

Implementation and Availability

Following last week’s launch, Perplexity Pro users can now access Sonar through their account settings, where it can be set as the default search model. For enterprise applications, Sonar is also available through Sonar’s API*, offering seamless integration into existing workflows and applications.

The Future of Search?

The successful launch of Sonar represents more than just an incremental improvement in search technology – it’s proving to be a fundamental shift in how we interact with and retrieve information.

Image from Perplexity

By combining impressive speed with what Perplexity calls “increased factuality and readability”, Sonar is already setting new standards for AI-powered search. As Perplexity gathers more user feedback and continues to refine and enhance its capabilities, it will further expand the limits of what’s possible in information retrieval and processing.

By combining impressive speed with what Perplexity calls “increased factuality and readability”, Sonar is already setting new standards for AI-powered search, and as Perplexity gathers more user feedback and continues to refine and enhance its capabilities, it will further expand the limits of what’s possible in information retrieval and processing.

The integration with Cerebras infrastructure suggests that even faster processing speeds could be on the horizon, while the model’s foundation on Llama 3.3 70B provides a robust base for future improvements.

For organisations and individuals seeking more efficient ways to process and analyse information, Sonar’s launch marks an intriguing development in the rapidly evolving landscape of AI search technology.

Natural Language Processing
Featured

Share This Post