Convert Structure to Dataset

The Convert Structure to Dataset task transforms one of Rulex Factory’s data structures into a dataset.

There are different reasons why users may wish to convert structures.

Here are a few examples:

  • Analyze the dataset using a Data Manager for detailed insights. For example, users can filter rules through a formula to identify only those rules that have a specific coverage area.

  • The resulting dataset can be exported to an external item, such as a file or database table.

  • Produce a dataset that contains all the information included in a model, such as the weights of a neural networks or the coefficients of a linear regression.

The task is made of one tab only, the Options Tab.


The Options tab

The Options tab is made of two panes:

  • Structure pane, where users can select the structure which will be converted to a dataset.

  • Configuration pane, which is displayed after a structure has been chosen.

Structure pane

Within this pane, a drop-down list is available, to choose the data structure to be converted.

Available options are:

  • Association rules

  • Auto regressive models

  • Clusters

  • Cluster labels

  • Discretization cutoffs

  • Frequent itemsets

  • Frequent sequences

  • Monitor

  • Results

  • Rules

  • Models

  • Pca eigenvectors

  • Relevances

Configuration pane

As previously said, the configuration pane is displayed when a structure has been chosen.

After any of all the available structures has been chosen, apart from the Rules structure, the message “No parameters need to be set for this task: just compute the task by right-clicking it and selecting Compute > Compute selected” appears.

If a Rules structure has been chosen, the Structure pane is populated with the following options:

  • Dataset format: the format for the output dataset. The possible values are:
    • One row for each rule (default): the resulting table contains a row for each rule.

    • One row for each term: the resulting table contains a row for each condition attribute value within each rule.

    • One rule for each condition: the resulting table contains a row for each condition attribute within each rule.

  • Conditions on ordered attributes format: the required format for the conditions on ordered attributes. The possible values are:
    • a<x<b (default): values will always be displayed with greater than or less than indicators.

    • x>a, x in [a,b]: values will be displayed with greater than or less than indicators, or as a range when possible.

    • x in [0,Inf], x in [a,b]: values will always be displayed as a range, using an infinite value (Inf) when these are not the end of an available range value.


Example

The following example uses the Adult dataset.

  • After having created a structure using the Standard Clustering task, add a Convert Structure to Dataset task to the flow.

  • Select Clusters in the Select the structure option and save and compute the task.

  • The output dataset will contain the results in tabular format, as they are available in the Clusters tab in the Standard Clustering task.

https://cdn.rulex.ai/docs/Factory/convert-structure-to-dataset-example-1.webp