Chapter 5. Resolved issues
This section describes notable issues that have been resolved in Red Hat OpenShift Data Science 2.4.
DATA-SCIENCE-PIPELINES-OPERATOR-294 - Scheduled pipeline run that uses data-passing might fail to pass data between steps, or fail the step entirely
A scheduled pipeline run that uses an S3 object store to store the pipeline artifacts might fail with an error such as the following:
Bad value for --endpoint-url "cp": scheme is missing. Must be of the form http://<hostname>/ or https://<hostname>/
This issue occurred because the S3 object store endpoint was not successfully passed to the pods for the scheduled pipeline run. This issue is now resolved.
RHODS-4769 - GPUs on nodes with unsupported taints cannot be allocated to notebook servers
GPUs on nodes marked with any taint other than the supported nvidia.com/gpu taint could not be selected when creating a notebook server. This issue is now resolved.
RHODS-6346 - Unclear error message displays when using invalid characters to create a data science project
When creating a data science project’s data connection, workbench, or storage connection using invalid special characters, the following error message was displayed:
the object provided is unrecognized (must be of type Secret): couldn't get version/kind; json parse error: unexpected end of JSON input ({"apiVersion":"v1","kind":"Sec ...)
The error message failed to clearly indicate the problem. The error message now indicates that invalid characters were entered.
RHODS-6950 - Unable to scale down workbench GPUs when all GPUs in the cluster are being used
In earlier releases, it was not possible to scale down workbench GPUs if all GPUs in the cluster were being used. This issue applied to GPUs being used by one workbench, and GPUs being used by multiple workbenches. You can now scale down the GPUs by selecting None from the Accelerators list.
RHODS-8939 - Default shared memory for a Jupyter notebook created in a previous release causes a runtime error
Starting with release 1.31, this issue is resolved, and the shared memory for any new notebook is set to the size of the node.
For a Jupyter notebook created in a release earlier than 1.31, the default shared memory for a Jupyter notebook is set to 64 MB and you cannot change this default value in the notebook configuration.
To fix this issue, you must recreate the notebook or follow the process described in the Knowledgebase article How to change the shared memory for a Jupyter notebook in Red Hat OpenShift Data Science.
RHODS-9030 - Uninstall process for OpenShift Data Science might become stuck when removing kfdefs
resources
The steps for uninstalling OpenShift Data Science self-managed are described in Uninstalling Red Hat OpenShift Data Science self-managed.
However, even when you followed this guide, you might have seen that the uninstall process did not finish successfully. Instead, the process stayed on the step of deleting kfdefs
resources that were used by the Kubeflow Operator. As shown in the following example, kfdefs
resources might exist in the redhat-ods-applications
, redhat-ods-monitoring
, and rhods-notebooks
namespaces:
$ oc get kfdefs.kfdef.apps.kubeflow.org -A NAMESPACE NAME AGE redhat-ods-applications rhods-anaconda 3h6m redhat-ods-applications rhods-dashboard 3h6m redhat-ods-applications rhods-data-science-pipelines-operator 3h6m redhat-ods-applications rhods-model-mesh 3h6m redhat-ods-applications rhods-nbc 3h6m redhat-ods-applications rhods-osd-config 3h6m redhat-ods-monitoring modelmesh-monitoring 3h6m redhat-ods-monitoring monitoring 3h6m rhods-notebooks rhods-notebooks 3h6m rhods-notebooks rhods-osd-config 3h5m
Failed removal of the kfdefs
resources might have also prevented later installation of a newer version of OpenShift Data Science. This issue no longer occurs in Red Hat OpenShift Data Science 2.4.
RHODS-9764 - Data connection details get reset when editing a workbench
When you edited a workbench that had an existing data connection and then selected the Create new data connection option, the edit page might revert to the Use existing data connection option before you had finished specifying the new connection details. This issue is now resolved.