Post #51

@perplexity_ai

Perplexity

Views4,780Post view count

PostedOct 410/04/2023, 09:56 PM

Post content

Hey, everyone! Introducing pplx-api, our LLM API which serves Mistral and Llama2 models with blazing speed and throughput. pplx-api is in public beta for our Pro subscribers! We partnered with NVIDIA and AWS to build our proprietary inference. Read our blog post to learn more. Key features of pplx-api: - Up to 2.9x lower latency than Replicate & 3.1x lower than Anyscale - Battle-tested infra, serving 1B tokens in our prod environment daily - One stop shop for open-source LLMs Read the API specs here: https://docs.perplexity.ai/reference/post_chat_completions Ideal for developers seeking to incorporate open-source LLMs into their products or projects, our API offers fast inference speeds without requiring extensive C++/CUDA knowledge or GPU access. Subscribe to Perplexity Pro to try it out: https://perplexity.ai/pro