2024-09-12
Voice Mode Improvement
- Users can now replay the generated audio when using voice mode in playground.
- Users can now copy sample code for voice mode available in Python, JavaScript, and cURL command.
- All LLM model APIs now have voice mode enabled, including models such as Qwen72B, Llama 3.1 8B, Llama 3.1 70B, and Llama 3.1 405B. This feature allows for voice interaction across a range of advanced language models, enhancing user experience and accessibility.
Deployment Version History
- Users can now view the history of deployment versions and see what changes were made, when, and by whom. This feature helps users better understand the status and changes of the deployment over time and aids in debugging if needed.
Deployment Authentication Token Generation
- Users can now generate a random token for deployment authentication while creating a new deployment directly.
Direct API Testing Under Deployment
- Users can now try out the APIs directly under their deployments with improved API documentation and user interface. This enhancement allows for easier and more efficient API testing and integration within the deployment environment.
Deployment Status Enhancement
- Users can now view the number of pending replicas in the deployment overview page. This feature provides a clearer and quicker understanding of the current state of the deployment, helping users to easily identify any pending actions or issues.
- Users can now view the stopped and scaling states in the deployment status indicator. This improvement provides a clearer understanding of the deployment status, helping users easily monitor and manage their deployments.
Enhanced Replica Timeline with Event Visibility
- Users can now observe events such as restarts and crashes directly in the replica timeline to understand the status and availability of their services.
Job Submission and Running History
- Users can now view historical jobs by applying the 'Archived' flag in the job list filter.