Import Data to Project

There are many ways to bring your data into your project, in this section we'll cover using from the following data sources:

  • Using virtualized data

Note: The lab instructions below assume you have your project already available. If not, follow the instructions in the pre-work section to create a project.

Importing Virtualized Data

For this section we'll explore the data that is available from the virtualized views that have been created in Data Virtualization. This data may come from various data sources or tables within a single source, but will appear as a single data asset. We will add this data to an analytics project so that it can be used in subsequent modules of this workshop (i.e for data analysis and to build ML models).

Assign the data to your project

  • To launch the data virtualization tool, go the (☰) navigation menu and under the Data section click on Data virtualization.

(☰) Menu -> Collect -> Data Virtualization
  • From the data virtualization sub-menu at the top left of the page, click on the menu drop down list and choose My virtualized data. Here you should see the data you have virtualized or that you have been given access to (or that the administrator has assigned to you).

(My virtualized data
  • Select the checkbox next to the data sets you want to use in your project and click the Assign button to start importing it to your project.

Note: The name of the data assets to select may vary based on names chosen during data virtualization. The default names to select are: LOANS, APPLICANTFINANCIALDATA, APPLICANTPERSONALDATA, APPLICANTFINANCIALPERSONALDATA AND APPLICANTFINANCIALPERSONALLOANDATA

Select the data you want to import
  • In the 'Assign virtual objects' screen, choose your analytics project from the drop down list. Then click the Assign button to add the data to your project.

Assign the data to a project
  • In the Publish virtual assets to catalog pop up panel, select the Publish button to publish these assets to the catalog.

Publish to catalog
  • In the pop up panel, you will receive a confirmation that the objects have been assigned to your project. Click the Go to project button.

Data assigned to a project
  • In the project page, clicking on the Assets tab will show the virtualized tables and joined tables that are now in your project (along with other assets that are in the project).

    Do not go to the next section until you see the data assets in your project.

Conclusion

This lab shows just one of the ways to gather data for your analytics projects in Cloud Pak for Data. In this case you used data that was previously virtualized. Other ways might include: browsing the catalogs, importing flat files, or importing data from connections directly in the project.