Search

Chapter 3. Using data connections

download PDF

3.1. Adding a data connection to your data science project

You can enhance your data science project by adding a connection to a data source. When you want to work with a very large data sets, you can store your data in an S3-compatible object storage bucket, so that you do not fill up your local storage. You also have the option of associating the data connection with an existing workbench that does not already have a connection.

Prerequisites

  • You have logged in to Red Hat OpenShift AI.
  • If you are using specialized OpenShift AI groups, you are part of the user group or admin group (for example, rhoai-users or rhoai-admins ) in OpenShift.
  • You have created a data science project that you can add a data connection to.
  • You have access to S3-compatible object storage.
  • If you intend to add the data connection to an existing workbench, you have saved any data in the workbench to avoid losing work.

Procedure

  1. From the OpenShift AI dashboard, click Data Science Projects.

    The Data Science Projects page opens.

  2. Click the name of the project that you want to add a data connection to.

    A project details page opens.

  3. Click the Data connections tab.
  4. Click Add data connection.

    The Add data connection dialog opens.

  5. Enter a name for the data connection.
  6. In the Access key field, enter the access key ID for your S3-compatible object storage provider.
  7. In the Secret key field, enter the secret access key for the S3-compatible object storage account you specified.
  8. In the Endpoint field, enter the endpoint of your S3-compatible object storage bucket.
  9. In the Region field, enter the default region of your S3-compatible object storage account.
  10. In the Bucket field, enter the name of your S3-compatible object storage bucket.
  11. Optional: From the Connected workbench list, select a workbench to connect.
  12. Click Add data connection.

Verification

  • The data connection that you added appears on the Data connections tab for the project.
  • If you selected a workbench, the workbench is visible in the Connected workbenches column on the Data connections tab for the project.

3.2. Deleting a data connection

You can delete data connections from your data science projects to help you remove connections that are no longer relevant to your work.

Prerequisites

  • You have logged in to Red Hat OpenShift AI.
  • If you are using specialized OpenShift AI groups, you are part of the user group or admin group (for example, rhoai-users or rhoai-admins) in OpenShift.
  • You have created a data science project with a data connection.

Procedure

  1. From the OpenShift AI dashboard, click Data Science Projects.

    The Data Science Projects page opens.

  2. Click the name of the project that you want to delete the data connection from.

    A project details page opens.

  3. Click the Data connections tab.
  4. Click the action menu () beside the data connection that you want to delete and then click Delete data connection.

    The Delete data connection dialog opens.

  5. Enter the name of the data connection in the text field to confirm that you intend to delete it.
  6. Click Delete data connection.

Verification

  • The data connection that you deleted is no longer displayed on the Data connections tab for the project.

3.3. Updating a connected data source

To use an existing data source with a different workbench, you can change the data source that is connected to your project’s workbench.

Prerequisites

  • You have logged in to Red Hat OpenShift AI.
  • If you are using specialized OpenShift AI groups, you are part of the user group or admin group (for example, rhoai-users or rhoai-admins) in OpenShift.
  • You have created a data science project, created a workbench, and you have defined a data connection.

Procedure

  1. From the OpenShift AI dashboard, click Data Science Projects.

    The Data Science Projects page opens.

  2. Click the name of the project whose data source you want to change.

    A project details page opens.

  3. Click the Data connections tab.
  4. Click the action menu () beside the data source that you want to change and then click Edit data connection.

    The Edit data connection dialog opens.

  5. In the Connected workbench section, select an existing workbench from the list.
  6. Click Update data connection.

Verification

  • The updated data connection is displayed on the Data connections tab for the project.
  • You can access your S3 data source using environment variables in the connected workbench.
Red Hat logoGithubRedditYoutubeTwitter

Learn

Try, buy, & sell

Communities

About Red Hat Documentation

We help Red Hat users innovate and achieve their goals with our products and services with content they can trust.

Making open source more inclusive

Red Hat is committed to replacing problematic language in our code, documentation, and web properties. For more details, see the Red Hat Blog.

About Red Hat

We deliver hardened solutions that make it easier for enterprises to work across platforms and environments, from the core datacenter to the network edge.

© 2024 Red Hat, Inc.