Lab: IGC: Create custom rules for data analysis

Information Server enables users to better understand their data. It uses pre-built and custom rules that will apply meaning and quality measurements, which are available for users of the data and interested parties.

This lab, will give you hands-on experience using Information Server's rule capabilities. In this lab, you will learn the following:

  • How to create a data rule

  • Apply the rule to a data set

  • View data that does not apply to the rule

This section is comprised of the following steps:

1. Import and view the data

Launch Firefox and click on the Launchpad bookmark. When the Information Server launchpad shows up click on the Information Governance Catalog New tile.

1-iis-launchpad-new

Log in with the username isadmin and password inf0Xerver.

2-gc-login

The overview page will appear.

3-gc-landing

Click on the Connections tab and ensure the connection JK_ORD appears and the host is IIS-SERVER.IBM.DEMO. This connection is pre-established for convenience and was imported to the JK Orders workspace, let's take a look at it.

4-connections

To find the JK Orders workspace, go to Quality tab and click on the right tile.

5-workspaces

The workspace overview will load. Take a few moments to browse the graphics on the page and click on the Data sets link to view the data in this exercise.

6-workspace-overview

Before we create new rules let's look at the two data sets that will be used in this example. Take a look at the Sales data set first.

7-data-sets

Click on the Columns tab to view findings from the analyzer. It found many things when the data was imported, like maximum and minimum values, distinct values, format, and uniqueness.

8-sales

Repeat the same for the CUSTOMERS data set.

9-customers

We're now ready to create our first data rule!

2. Create a data rule

From the JK Orders workspace click on the Data rules tab.

10-data-rules

Expand the Published Rules section, then the 01 Personal Identity section, and then the Age section to reveal the AdultInRangeNumeric rule.

Navigate to the right to show the menu. Choose the Manage in workspace option from the menu.

11-manage-rule

The rule should now appear under the All section. Navigate to the right to show the menu. Choose the Edit option from the menu.

12-edit-rule

Now we're going to edit the rule so that it flags ages that are under 18 or over 120. To do that, edit the text boxes to the right that show minage and maxage and update them with 18 and 120 respectively. The formula at the bottom should read:

age >= 18 and age < 120

Save the rule.

13-rule-logic

When you are brought back to the Data rules tab you'll notice that the new rule has an error. We need to publish the rule. To do so navigate to the right to show the menu. Choose the Publish option from the menu.

14-publish

Confirm the publish action.

15-publish-confirm

3. Re-analyze and view results

Go back to the JK Orders workspace and click on the Data sets links

16-data-sets

Click on the SALES data set.

17-sales

We can now apply the rule by clicking the + Add rule button.

18-rules

Choose the AdultInRangeNumeric rule and click Next.

19-select-rule

We now have to bind the rule to a specific column in our data set. Click on the age variable name on the left, this is the rule. And click on the AGE column name on the right, this is our data source. Click on the Bind button and once bound you can click Next.

20-bind

We don't need to test the rule this time, we can just click on Save.

21-test

You should now see the rule in the data set view. Click on the Analyze button to restart the analysis with the new rule.

22-analyze

Confirm to start the analysis.

23-analyze-confirm

You may need to refresh your browser a few times for this step. It will take a few minutes. You'll see the state go to Running to Successful when done.

24-running

Once complete, go to the Data quality tab. You'll see that the new rule has one finding. Click on the findings to view the exception.

25-findings

Scrolling to the right you'll see that there is one entry whos age is 17, this is less than the required 18 for our rule!

26-findings-view

CONGRATULATIONS!! You have completed this lab!