Install Alauda AI Generative
Alauda AI Generative is a cloud-native component built on KServe for serving generative AI models. As an extension of the Alauda AI ecosystem, it specifically optimizes for Large Language Models (LLMs), offering essential features such as inference orchestration, streaming responses, and resource-based auto-scaling for generative workloads.
TOC
PrerequisitesRequired DependenciesOptional DependenciesInstallation NotesDownloading Cluster PluginUploading the Cluster PluginInstalling Alauda AI GenerativeEnvoy Gateway ConfigurationEnvoy AI Gateway ConfigurationKServe Gateway ConfigurationGIE(gateway-api-inference-extension) ConfigurationAlauda AI IntegrationUpgrading Alauda AI GenerativePrerequisites
Before installing Alauda AI Generative, you need to ensure the following dependencies are installed:
Required Dependencies
Alauda build of Envoy Gateway is natively integrated into ACP 4.2. For environments running earlier versions (including ACP 4.0 and 4.1), please contact Customer Support for compatibility and installation guidance.
Optional Dependencies
Installation Notes
- Required Dependencies: All three required dependencies must be installed before installing Alauda AI Generative.
- GIE Integration: If you want to use GIE, you can enable it during the installation process by selecting the "Integrated GIE" option in the Alauda AI Generative UI.
- Alauda AI Integration: If you don't need KServe Predictive AI functionality and only want to use LLM Generative AI, you can disable the "Integrated With Alauda AI" option during installation.
Downloading Cluster Plugin
Alauda AI Generative cluster plugin can be retrieved from Customer Portal.
Please contact Consumer Support for more information.
Uploading the Cluster Plugin
For more information on uploading the cluster plugin, please refer to Uploading Cluster Plugins
Installing Alauda AI Generative
-
Go to the
Administrator->Marketplace->Cluster Pluginpage, switch to the target cluster, and then deploy theAlauda AI GenerativeCluster plugin. -
In the deployment form, configure the following parameters as needed:
Envoy Gateway Configuration
Envoy AI Gateway Configuration
KServe Gateway Configuration
GIE(gateway-api-inference-extension) Configuration
Alauda AI Integration
-
Click Install to begin the installation process.
-
Verify result. You can see the status of "Installed" in the UI.
Upgrading Alauda AI Generative
- Upload the new version for package of Alauda AI Generative plugin to ACP.
- Go to the
Administrator->Clusters->Target Cluster->Functional Componentspage, then click theUpgradebutton, and you will see theAlauda AI Generativecan be upgraded.