Chapter 1. Architecture of OpenShift Data Science
Red Hat OpenShift Data Science is a fully Red Hat managed cloud service that is available as an Add-on to Red Hat OpenShift Dedicated and to Red Hat OpenShift Service on Amazon Web Services (ROSA).
OpenShift Data Science integrates the following components and services:
At the service layer:
- OpenShift Data Science dashboard
- A customer-facing dashboard that shows available and installed applications for the OpenShift Data Science environment as well as learning resources such as tutorials, quick start examples, and documentation. You can also access administrative functionality from the dashboard, such as user management, cluster settings, and notebook image settings. In addition, data scientists can create their own projects from the dashboard. This enables them to organize their data science work into a single project.
- Model serving
- Data scientists can deploy trained machine-learning models to serve intelligent applications in production. After deployment, applications can send requests to the model using its deployed API endpoint.
- Data science pipelines
- Data scientists can build portable machine learning (ML) workflows with data science pipelines, using Docker containers. This enables your data scientists to automate workflows as they develop their data science models.
- Jupyter (Red Hat managed)
- A Red Hat managed application that allows data scientists to configure their own notebook server environment and develop machine learning models in JupyterLab.
At the management layer:
- The Red Hat OpenShift Data Science operator
- A meta-operator that deploys and maintains all components and sub-operators that are part of OpenShift Data Science.
- Monitoring services
- Alertmanager, OpenShift Telemetry, and Prometheus work together to gather metrics from OpenShift Data Science and organize and display those metrics in useful ways for monitoring and billing purposes. Alerts from Alertmanager are sent to PagerDuty, responsible for notifying Red Hat of any issues with your managed cloud service.
When you install the OpenShift Data Science Add-on in the Cluster Manager, the following new projects are created:
-
The
redhat-ods-operator
project contains the OpenShift Data Science operator. -
The
redhat-ods-applications
project installs the dashboard and other required components of OpenShift Data Science. -
The
redhat-ods-monitoring
project contains services for monitoring and billing. -
The
rhods-notebooks
project is where notebook environments are deployed by default.
You or your data scientists must create additional projects for the applications that will use your machine learning models.
Do not install independent software vendor (ISV) applications in namespaces associated with OpenShift Data Science Add-ons unless you are specifically directed to do so on the application’s card on the dashboard.