Lab: IGC: Create custom rules for data analysis

Information Server enables users to better understand their data. It uses pre-built and custom rules that will apply meaning and quality measurements, which are available for users of the data and interested parties.

This lab, will give you hands-on experience using Information Server's rule capabilities. In this lab, you will learn the following:

  • How to create a data rule

  • Apply the rule to a data set

  • View data that does not apply to the rule

This section is comprised of the following steps:

1. Import and view the data

Launch Firefox and click on the Launchpad bookmark. When the Information Server launchpad shows up click on the Information Governance Catalog New tile.

Log in with the username isadmin and password inf0Xerver.

The overview page will appear.

Click on the Connections tab and ensure the connection JK_ORD appears and the host is IIS-SERVER.IBM.DEMO. This connection is pre-established for convenience and was imported to the JK Orders workspace, let's take a look at it.

To find the JK Orders workspace, go to Quality tab and click on the right tile.

The workspace overview will load. Take a few moments to browse the graphics on the page and click on the Data sets link to view the data in this exercise.

Before we create new rules let's look at the two data sets that will be used in this example. Take a look at the Sales data set first.

Click on the Columns tab to view findings from the analyzer. It found many things when the data was imported, like maximum and minimum values, distinct values, format, and uniqueness.

Repeat the same for the CUSTOMERS data set.

We're now ready to create our first data rule!

2. Create a data rule

From the JK Orders workspace click on the Data rules tab.

Expand the Published Rules section, then the 01 Personal Identity section, and then the Age section to reveal the AdultInRangeNumeric rule.

Navigate to the right to show the menu. Choose the Manage in workspace option from the menu.

The rule should now appear under the All section. Navigate to the right to show the menu. Choose the Edit option from the menu.

Now we're going to edit the rule so that it flags ages that are under 18 or over 120. To do that, edit the text boxes to the right that show minage and maxage and update them with 18 and 120 respectively. The formula at the bottom should read:

age >= 18 and age < 120

Save the rule.

When you are brought back to the Data rules tab you'll notice that the new rule has an error. We need to publish the rule. To do so navigate to the right to show the menu. Choose the Publish option from the menu.

Confirm the publish action.

3. Re-analyze and view results

Go back to the JK Orders workspace and click on the Data sets links

Click on the SALES data set.

We can now apply the rule by clicking the + Add rule button.

Choose the AdultInRangeNumeric rule and click Next.

We now have to bind the rule to a specific column in our data set. Click on the age variable name on the left, this is the rule. And click on the AGE column name on the right, this is our data source. Click on the Bind button and once bound you can click Next.

We don't need to test the rule this time, we can just click on Save.

You should now see the rule in the data set view. Click on the Analyze button to restart the analysis with the new rule.

Confirm to start the analysis.

You may need to refresh your browser a few times for this step. It will take a few minutes. You'll see the state go to Running to Successful when done.

Once complete, go to the Data quality tab. You'll see that the new rule has one finding. Click on the findings to view the exception.

Scrolling to the right you'll see that there is one entry whos age is 17, this is less than the required 18 for our rule!

CONGRATULATIONS!! You have completed this lab!

Last updated