Llama 3.1 70B
128K context
Description
Llama 3.1 70B is ideal for content creation, conversational AI, language understanding, R&D, and enterprise applications.
Pricing
- Dedicated Endpoints: Calculated by the instance type and the number of GPUs, you can find the details in pricing page. You can also contact us to reserve GPUs.
- Serverless Endpoints: $0.8 / M tokens for using Llama 3.1 70B, pay as you go.
Playground
System prompt
Temperature
Max tokens
Top P