datastage-standalone-workshop
  • Introduction
  • Getting Started
    • Lab: Access the environment
    • Lab: DataStage: Read and write to file (CSV)
    • Lab: DataStage: Read from a database (Db2 on IBM Cloud)
    • Lab: DataStage: Read and analyze data from multiple tables
    • Lab: DataStage: Read JSON from a REST call
    • Lab: DataStage: Import COBOL copybook and read EBCIDIC data
    • Lab: DataStage: Configure a Git server (BitBucket)
    • Lab: IGC: Create custom rules for data analysis
  • Resources
    • Additional resources
  • Survey
    • Feedback survey
  • Instructors
    • About this workshop
Powered by GitBook
On this page
  • 1. Import and view the data
  • 2. Create a data rule
  • 3. Re-analyze and view results

Was this helpful?

Export as PDF
  1. Getting Started

Lab: IGC: Create custom rules for data analysis

PreviousLab: DataStage: Configure a Git server (BitBucket)NextAdditional resources

Last updated 4 years ago

Was this helpful?

Information Server enables users to better understand their data. It uses pre-built and custom rules that will apply meaning and quality measurements, which are available for users of the data and interested parties.

This lab, will give you hands-on experience using Information Server's rule capabilities. In this lab, you will learn the following:

  • How to create a data rule

  • Apply the rule to a data set

  • View data that does not apply to the rule

This section is comprised of the following steps:

1. Import and view the data

Launch Firefox and click on the Launchpad bookmark. When the Information Server launchpad shows up click on the Information Governance Catalog New tile.

Log in with the username isadmin and password inf0Xerver.

The overview page will appear.

Click on the Connections tab and ensure the connection JK_ORD appears and the host is IIS-SERVER.IBM.DEMO. This connection is pre-established for convenience and was imported to the JK Orders workspace, let's take a look at it.

To find the JK Orders workspace, go to Quality tab and click on the right tile.

The workspace overview will load. Take a few moments to browse the graphics on the page and click on the Data sets link to view the data in this exercise.

Before we create new rules let's look at the two data sets that will be used in this example. Take a look at the Sales data set first.

Click on the Columns tab to view findings from the analyzer. It found many things when the data was imported, like maximum and minimum values, distinct values, format, and uniqueness.

Repeat the same for the CUSTOMERS data set.

We're now ready to create our first data rule!

2. Create a data rule

From the JK Orders workspace click on the Data rules tab.

Expand the Published Rules section, then the 01 Personal Identity section, and then the Age section to reveal the AdultInRangeNumeric rule.

Navigate to the right to show the ⋯ menu. Choose the Manage in workspace option from the menu.

The rule should now appear under the All section. Navigate to the right to show the ⋯ menu. Choose the Edit option from the menu.

Now we're going to edit the rule so that it flags ages that are under 18 or over 120. To do that, edit the text boxes to the right that show minage and maxage and update them with 18 and 120 respectively. The formula at the bottom should read:

age >= 18 and age < 120

Save the rule.

When you are brought back to the Data rules tab you'll notice that the new rule has an error. We need to publish the rule. To do so navigate to the right to show the ⋯ menu. Choose the Publish option from the menu.

Confirm the publish action.

3. Re-analyze and view results

Go back to the JK Orders workspace and click on the Data sets links

Click on the SALES data set.

We can now apply the rule by clicking the + Add rule button.

Choose the AdultInRangeNumeric rule and click Next.

We now have to bind the rule to a specific column in our data set. Click on the age variable name on the left, this is the rule. And click on the AGE column name on the right, this is our data source. Click on the Bind button and once bound you can click Next.

We don't need to test the rule this time, we can just click on Save.

You should now see the rule in the data set view. Click on the Analyze button to restart the analysis with the new rule.

Confirm to start the analysis.

You may need to refresh your browser a few times for this step. It will take a few minutes. You'll see the state go to Running to Successful when done.

Once complete, go to the Data quality tab. You'll see that the new rule has one finding. Click on the findings to view the exception.

Scrolling to the right you'll see that there is one entry whos age is 17, this is less than the required 18 for our rule!

CONGRATULATIONS!! You have completed this lab!

1. Import and view the data
2. Create a data rule
3. Re-analyze and view results
1-iis-launchpad-new
2-gc-login
3-gc-landing
4-connections
5-workspaces
6-workspace-overview
7-data-sets
8-sales
9-customers
10-data-rules
11-manage-rule
12-edit-rule
13-rule-logic
14-publish
15-publish-confirm
16-data-sets
17-sales
18-rules
19-select-rule
20-bind
21-test
22-analyze
23-analyze-confirm
24-running
25-findings
26-findings-view