cloudpakfordata-credit-risk-workshop
1.0.0
1.0.0
  • Introduction
  • Getting Started
    • Pre-work
  • Credit Risk Workshop
    • Data Connection and Virtualization
    • Import Data to Project
    • Data Visualization with Data Refinery
    • Enterprise data governance for Viewers using Watson Knowledge Catalog
    • Enterprise data governance for Admins using Watson Knowledge Catalog
    • Machine Learning with Jupyter
    • Machine Learning with AutoAI
    • Deploy and Test Machine Learning Models
    • Monitoring models with OpenScale GUI (Auto setup Monitoring)
    • Monitoring models with OpenScale (Notebook)
  • Workshop Resources
    • FAQs / Tips
  • Resources
    • IBM Cloud Pak for Data - Information and Trial
    • IBM Cloud Pak for Data - Knowledge Center
    • IBM Cloud Pak for Data - Platform API
    • IBM Cloud Pak for Data - Community
    • Watson Knowledge Catalog
    • Watson Knowledge Catalog Learning Center
    • IBM Developer
    • IBM Developer - Cloud Pak for Data
    • IBM Garage Architecture - Data
Powered by GitBook
On this page
  • Importing Virtualized Data
  • Assign the data to your project
  • Conclusion

Was this helpful?

  1. Credit Risk Workshop

Import Data to Project

PreviousData Connection and VirtualizationNextData Visualization with Data Refinery

Last updated 4 years ago

Was this helpful?

There are many ways to bring your data into your project, in this section we'll cover using from the following data sources:

  • Using virtualized data

Note: The lab instructions below assume you have your project already available. If not, follow the instructions in the pre-work section to create a project.

Importing Virtualized Data

For this section we'll explore the data that is available from the virtualized views that have been created in Data Virtualization. This data may come from various data sources or tables within a single source, but will appear as a single data asset. We will add this data to an analytics project so that it can be used in subsequent modules of this workshop (i.e for data analysis and to build ML models).

Assign the data to your project

  • To launch the data virtualization tool, go the (☰) navigation menu and under the Data section click on Data virtualization.

  • From the data virtualization sub-menu at the top left of the page, click on the menu drop down list and choose My virtualized data. Here you should see the data you have virtualized or that you have been given access to (or that the administrator has assigned to you).

  • Select the checkbox next to the data sets you want to use in your project and click the Assign button to start importing it to your project.

Note: The name of the data assets to select may vary based on names chosen during data virtualization. The default names to select are: LOANS, APPLICANTFINANCIALDATA, APPLICANTPERSONALDATA, APPLICANTFINANCIALPERSONALDATA AND APPLICANTFINANCIALPERSONALLOANDATA

  • In the 'Assign virtual objects' screen, choose your analytics project from the drop down list. Then click the Assign button to add the data to your project.

  • In the Publish virtual assets to catalog pop up panel, select the Publish button to publish these assets to the catalog.

  • In the pop up panel, you will receive a confirmation that the objects have been assigned to your project. Click the Go to project button.

  • In the project page, clicking on the Assets tab will show the virtualized tables and joined tables that are now in your project (along with other assets that are in the project).

    Do not go to the next section until you see the data assets in your project.

Conclusion

This lab shows just one of the ways to gather data for your analytics projects in Cloud Pak for Data. In this case you used data that was previously virtualized. Other ways might include: browsing the catalogs, importing flat files, or importing data from connections directly in the project.

(☰) Menu -> Collect -> Data Virtualization
(My virtualized data
Select the data you want to import
Assign the data to a project
Publish to catalog
Data assigned to a project