SYNOPSIS

Input

A record stream to count double coverage in over time.

Output

A record stream with the double coverage count matrix for the previous period.

DESCRIPTION

This tool allows you to detect relations between two data sets. This is best illustrated through some examples:

OPTIONS

Duration (seconds)

Time period in seconds to count double coverage over.

A-Key

The first field key to match against.

B-Key

The second field key to match against.

Identifier Key

The field key to identify a unique user.

EXAMPLES

You are monitoring a HTTP record stream and would like to see which browsers are used on which part of your web site (to detect incompatibility issues - some browsers may never reach a certain part of your web site). Your clients are identified by their IP addresses.

This can be achieved by following these steps:

  1. Make sure your Double Coverage tool receives an input containing the fields client_ip, agent and url. The url field must previously have been derived from a HTTP record stream by the Derive tool.

  2. Select url in the A-Key field, agent in the B-Key field and client_ip in the Identifier Key field.

  3. Save the output of the Double Coverage tool in a database table.

When you analyze the database table containing the output of the Double Coverage tool, you can see numbers indicating how many clients browsed a given url with a given agent.

The data could look something like this:

timestamp 2003-07-15 10:22:24
url www.my_site.com/my_texts/introduction.php
agent Mozilla/4.0 (compatible; MSIE 5.5; Windows 98)
dc_count 11

This has the following significance:

The url www.my_site.com/my_texts/introduction.php has been hit 11 times by the agent Mozilla/4.0 (compatible; MSIE 5.5; Windows 98).

Another example is:

You are monitoring a HTTP record stream and you would like to detect client movements from one domain to another domain. Your clients are identified by their IP addresses.

This can be achieved by following these steps:

  1. Make sure your Double Coverage tool receives an input containing the fields client_ip and host.

  2. Select host in the A-Key field, host in the B-Key field and client_ip in the Identifier Key field.

  3. Save the output of the Double Coverage tool in a database table.

When you analyze the database table containing the output of the Double Coverage tool, you can see numbers indicating how many clients browsed from one host to another host.

The data could look something like this:

timestamp 2003-07-15 10:22:24
host1 www.my_site.com
host2 www.my_shop.com
dc_count 31

This has the following significance:

On 31 occasions a client went from the host www.my_site.com to the host www.my_shop.com. The clients may have moved directly from www.my_site.com to www.my_shop.com or via another host, e.g.

www.my_site.com -> www.my_information.com -> www.my_shop.com