HPE PCAI - Behind the Curtain
In this blog
Welcome to this blog on HPE PCAI (Private Cloud AI). In this blog, we will show you what it takes to set up this environment and give you, as an infrastructure engineer, tips on how to support it.
Part of the NVIDIA AI Computing by HPE portfolio, HPE Private Cloud AI is a co-developed scalable, pre-configured, AI-ready private cloud that gives AI and IT teams powerful tools to innovate while simplifying ops and keeping your data under your control.
HPE Private Cloud AI comes in various sizes based on your use case, ranging from small—inference (rack servers, switches and storage) to XL—RAG/fine model tuning (multiple rack solution with at least 11 nodes and 670 TB of storage). Our solution at WWT's Advanced Technology Center (ATC) is a small T-shirt size, which can be shown off in a demo for both the account team and customers via this schedulable lab in the AI Proving Ground: HPE Private Cloud AI.
The demo that we would walk through for you would consist of the following:
- Deployment of solution from the ground up
- Rack/Cable
- Day 0 operations to get the system online
- Solution overview from the eyes of an infrastructure engineer
- Green Lake for Files Array
- vCenter and K8 cluster components
- Switching - NVIDIA and Aruba
- Day 2 operations: Connecting to Green Lake Portal
- Providing access to data scientists and dev engineers to deploy apps
- Deploying a chatbot or virtual assistant example
What makes this any different from any other solution out there? Why would I choose to use this? Let's examine more to see if we can answer those questions.
It's a fully integrated solution designed to allow enterprise users to start building their AI applications without having to think about sizing up their infrastructure.
In this case, the big problem is having the hardware together like either of those solutions did and combining HPE AI Essentials and NVIDIA AI Enterprise software to make it useable for your AI use case. This is what sets this solution apart from all others. The best thing about this solution is that all these tools are included from the get-go. You don't have to go out and download Spark, feast, kubeflow, ezpresto; they have workflows you can utilize directly from the solution.
The setup
Our solution consisted of four servers (three servers used in the VMware cluster to run the K8 virtual nodes and 1 DL380 as the inference node worker node), two NVIDIA switches, four Aruba switches and a couple of nodes to make up a Green Lake for Files array.
Once we completed all the networking online and firmware upgrades, we used the steps below to deploy the solution:
- Fill out the deployment sheet for both Green Lake for Files and HPE Private Cloud AI.
- Setup two different /24 networks - One for internal networks and One for an External Network to be used in deployment
- Deploy the DSC VM that will be used to deploy the infrastructure and connect to Green Lake
- Set the DSC VM management network and activate it in the Private Cloud AI portal
- Roll the rack into the Data Center
- Setup port channel from our core to the NVIDIA Switches
- HPE GreenLake cloud setup and onboarding
- Deploy HPE GreenLake for Files
- Initiate Private Cloud AI setup wizard – which will configure the Infrastructure pieces:
- Infrastructure - Management, iLO and data network info control plane node discovery, vCenter, ESXi, iLO credentials.
- Control plane VM - Control plane VM networking, credentials.
- Worker node servers - Discovery Management, iLO and data network info.
- Platform ingress IP
- Integration with HPE GreenLake for Files.
- Verify Setup is complete and access the Green Lake AI portal to start deploying AI solutions and providing access to the data scientists and developers
Once the above setup was completed, we connected to the AI Private Cloud instance via the GreenLake portal below.
From there, you can start configuring roles and allowing users access to add a virtual chatbot app or even a Jupyter Notebook.
Example of the virtual chatbot.
Conclusion
Hopefully, this blog has piqued your interest in what HPE PCAI (Private Cloud AI) can do to help speed up the time to deliver AI solutions to your customers and give the engineer who is on the hook to get their AI initiatives up and running a good idea of what they would need to support and set up.
References
https://www.hpe.com/us/en/solutions/artificial-intelligence/nvidia-collaboration.html