Build and Solve¶

Warning

This task is available on specific request only. Contact us for specific pricing.

The Build/Solve task constructs and solves an optimization problem. This task requires two input files:

a dataset showing the problem’s features, which doesn’t need to have a specific structure, it only needs to contain a row for each possible solution.
a configuration file, defining the constraints of the problem. A full description of the configuration file, along with its features and syntax is provided in the corresponding section.

After having prepared the configuration file following the provided guidelines, the task itself has a simple configuration layout, made of the Options tab only.

Warning

If the problem’s features file hasn’t got an empty column for the solution, the task raises an error during its computation.

The Options tab¶

The Build/Solve task requires two input tasks: one containing the problem’s features, and another containing the optimization constraints. When opening the Build/Solve task, the Options tab is the only tab available.

It is divided into two areas:

in the left one, users must define the tasks containing the problem’s description and the configuration file. This is possible through the following options:
- Rule imported from task: specify the task containing the file with the constraints’ definition.
- Data imported from task: specify the task containing the file with the details which make up the problem.
in the area located on the right, users can customize the optimization features through the following options:
- Optimization mode: specify the solution’s outcome. The possible values are:
  
  Minimize (default)
  
  Maximize
- Maximum execution time (seconds): specify the execution’s time limit in seconds. If not specified, no limits are set.
- Value of priority for stopping iteration (where 0 means no stopping value): the priority value where the loop on the priority will be stopped. If not specified, the loop doesn’t stop.
- Do not evaluate formula post optimization: if selected, the formulas with the after Application value in the configuration file will not be evaluated when stopping the priority evaluation.
- Save constraints in cluster: if selected, the sparse input matrix will be saved in a cluster structure. This structure can be visualized by importing it through an Import from Task task.
- Add auxiliary rows: if selected, auxiliary rows will be added to the output dataset. The auxiliary rows provide information on the auxiliary variables, which have been defined in the configuration file and added during the optimization process.
- Solver used for optimization: select the solver which will be used for the optimization operation. Possible solvers are:
  - Coin-Or
  - Symphony
  - Naive (Rulex proprietary solver, it is faster than the others, but it provides a less accurate result)
- Stop at first feasible solution: if selected, the optimization operation stops when a solution satisfying all the constraints is found. It might be possible that the solution found is not the best one.
- Add feasible column in dataset: if selected, a binary column, called ‘Feasible’ is added to the dataset. The value ‘True’ indicates that the solution is feasible, while the value ‘False’ indicates that the solution is unfeasible.

The constraints configuration file¶

The configuration file must follow a strict setup: it can have any format, (the most commonly used is MS Excel) and it must have two sheets, each of them containing specific information:

the first sheet contains the variables definition,
the second sheet contains the constraints definition.

Warning

The sheets must follow the order listed above, otherwise the optimization operation won’t make sense.

Tip

The two sheets don’t need to have a specific name, so users can choose the one which fits better according to their needs.

Sheet 1 - Variables definition

The first sheet of the configuration file is the one defining variables. The system will check if the specified attributes satisfy the specified features. The more variables are specified and described, the more precise and reliable will be the optimization output. The sheet name can be customized by the user. The first sheet must contain the following columns:

Attribute: the name of the attribute which is being defined as a variable. Specify also the attributes which need to be added to the dataset and defined in the Formula column.
Role: the role the corresponding attribute is going to play in the optimization process. The possible values are:
- Key: the task verifies that the attribute is a key. The task checks that each row of the key attribute has a different value. More than one attribute can be specified.
- Input: attributes containing important information to solve the optimization problem.
- Solution (mandatory): variable representing the solution. This role must be specified.
- Cost: the cost associated to each solution. It indicates the cost value of only one unit of the Solution.
- Priority: the priority value associated to each row. The corresponding attribute’s type must be integer, with positive values only, where the value 1 indicates the highest priority, 2 a priority lower than 1, and so on.
- Type: the type of the solution generated by the task. If this role is not specified, the type will be taken from the Solution attribute. As the Solution attribute is usually empty, its type can be customized using a Data Manager, before the Build/solve task and linked one to each other. The values in the corresponding Attribute can be:
  
  0: continuous solution
  
  1: integer solution
  
  2: binary solution
- Minimum: the solution’s minimum value. If not specified, its value is 0.
- Maximum: the solution’s maximum value. If not specified, its value is infinite.
Distinct (optional): the system checks that the attributes specified in the Attribute column values is constant for each attribute value in the Distinct column. More than one attribute can be specified, so the system checks that the attribute is constant for the associated group values. If they are not constant, the system raises an error.
Formula (optional): here, any Data Manager formula can be inserted, plus the IF+THEN+ELSE rule, to fill the attribute in attribute column with the results of the formula. This field can be used to define new attributes that must be added to the problems’ dataset when they are not already present. These variables depend on the inputs defined in the Roles column.
Application (optional): if a sequence of formulas must be applied, the priority can be specified in this column. The possible values are:
- (missing value): if the value is left empty, the evaluation is performed only once, before starting the optimization.
- before: the variable-formula is calculated at each iteration, before the optimization process.
- after: the variable-formula is calculated after the end of the optimization process.
Description (optional): a description of the defined variable in natural language. Users can add any description to better understand the variable’s features.
Priority (optional): as the priority can be dynamic, this column contains integer numbers which define that the functions and the constraints in the corresponding row must be calculated when the specified priority is being evaluated by the Build and Solve.

Sheet 2 - Constraints definition

The second sheet defines the constraints which must be taken into consideration when the task performs the analysis. As previously said, the sheet’s name can be customized by the user, but it must contain the following columns:

Constraint: definition of the constraint using Rulex syntax. Two types of constraints can be added, depending on their aim:
- variable’s definition: as there are some variables which depend on the solution, and are calculated during the analysis, these constraints define the new attribute starting from the solution. These constraints must be created using the define keyword in the rule. Once created, the defined variable can be used in other constraints.
- constraints: they are constraints which are applied to the existing variable-solutions or to solution dependent variables. These constraints must be created using the let keyword in the rule.
Description (optional): a description of the constraint in natural language. Users can add any description to better understand the constraint’s features.

More information on keywords and on the constraints’ syntax is provided in the paragraph below.

Constraints syntax

The following keywords must be taken into consideration while writing a constraint, as each one of them gives specific directions to the system to calculate the best solution:

define: it introduces an auxiliary variable definition.
let: it introduces a constraint.
when: it works as a filter, it specifies the object’s features. It is possible to use all the Data Manager conditions.
solution: it is used with a formula.
foreach: it is used to check a group’s values.
forevery: when it is used to check numerical attributes, it emulates the behavior of a moving window made of the number of values specified in the overlap parameter of the chosen attribute. When it is used to check nominal attributes, it evaluates all the possible combinations, without repeating them: for example, if the combination A, B``has already been evaluated, the combination ``B A won’t be taken into consideration. The syntax is forevery + length of window + attribute used for moving window + overlap + number of rows to shift over.
overlap: it specifies the shift values of the attributes checked with the forevery parameter. Its values must be numerical only.
cost: it is the cost related to the corresponding constraint. When a define statement is present, it is the unit cost of the auxiliary variable. In the other cases, it is the cost of the constraint that is added to the overall cost when the constraint is violated. A Data Manager formula can be inserted.
minimum: it is the auxiliary variable’s minimum value. A Data Manager formula can be inserted.
maximum: it is the auxiliary variable’s maximum value. A Data Manager formula can be inserted.
tol: it indicates the allowed tolerance value, according to the set parameters. It indicates the allowed error, how much the constraint can be violated.
type: the auxiliary variable’s type. It can be defined with a Data Manager formula or with the following values:
- 0: continuous type.
- 1: integer type.
- 2: binary type.

Important

While writing the constraint, the following keyword order must be followed in the string:

when
foreach
forevery
define or let
all the other keywords left

Example¶

This example uses the Data and Rules datasets.

As the Build/Solve task can produce not only a simple dataset containing the solution, but also auxiliary rows or the constraint matrix, this paragraph will contain both the example with the simple solution dataset and the example with the auxiliary rows and the constraints matrix.

Import the two required datasets to perform the analysis.
The first dataset contains a full description of the shipping, through attributes like the Priority, the Dest, the Source and so on. You can visualize the dataset itself by opening it through a Data Manager task or by right-clicking on the import task containing it and selecting Take a Look. For ease in working in the flow, we recommend you to rename the task containing the dataset with the problem details, so that it is easily recognizable while configuring the Build/Solve task.

https://cdn.rulex.ai/docs/Factory/buildsolve-example-dataset.webp

The second dataset is the configuration file, containing the variables’ definition and the constraints. In the first sheet, the variables have been defined. For example, we can find the following definitions:
- The Material attribute, along with the Source and Dest attributes, have been defined as Key in the Role column.
- The Demand, MovingCost, RemCost, SourceCost and DestCost have been defined so that the system checks that the combinations written in the Distinct columns are constant.
- The attributes located from row 13 to row 21 are calculated during the analysis: their value is defined by the function indicated in the Formula field in the configuration file, and the moment where they will be calculated is defined in the Application column. You can visualize the dataset itself by opening it through a Data Manager task or by right-clicking on the import task containing it and selecting Take a Look. For ease in working in the flow, we recommend you to rename the task containing the dataset with the problem details, so that it is easily recognizable while configuring the Build/Solve task.

https://cdn.rulex.ai/docs/Factory/buildsolve-example-1.webp

The second sheet contains the constraints. For example:
- The Remaining demand at destination for a given material has been defined with the following constraint: foreach ($”Material”, $”Dest”) define $”RemQty” + sum($”Moved”) = -$”New(Demand)” minimum 0 cost $”RemCost”, which means that the remaining demand must be calculated taking into account that each $”Material” and $”Dest” pair plus the sum of the $”Moved” attribute must be equal to the corresponding value of the $”New(Demand)” attribute. The minimum value for the auxiliary variable must be 0, and the cost of this solution is the $”RemCost” attribute.
- Note that the $”New(Demand)” attribute has been evaluated before the optimization started, so it is normal not to see it in the original shipping dataset.

https://cdn.rulex.ai/docs/Factory/buildsolve-example-2.webp

Simple solution dataset

Add a Build/Solve task, and link both datasets to it. Configure the task as follows:
- Rule imported from task: rules (the name we gave to the task containing the configuration file)
- Data imported from task: data (the name we gave to the task containing the shipping data which make up the problem)
- Leave the other options as default.
- Save and compute the task.

https://cdn.rulex.ai/docs/Factory/buildsolve-example-3.webp

Add a Data Manager task and link it to the Build/Solve task.
Then, double-click on the Data Manager to visualize the results.
The solution dataset has been updated with the new columns, filled with the values that are the result of the configuration file’s variables and constraints evaluation.

https://cdn.rulex.ai/docs/Factory/buildsolve-example-4.webp

Auxiliary rows and constraints matrix

After having added a Build/Solve task, configure it as follows:
- Rule imported from task: rules (the name we gave to the task containing the configuration file)
- Data imported from task: data (the name we gave to the task containing the shipping data which make up the problem)
- Check the Add auxiliary rows and the Save constraints in cluster checkboxes.
- Save and compute the task.

https://cdn.rulex.ai/docs/Factory/buildsolve-example-5.webp

To visualize the generated solution, along with its auxiliary rows, add a Data Manager task and link it to the Build/Solve task. The dataset will have more rows, if we compare it to the solution dataset generated by leaving the configuration options untouched.

https://cdn.rulex.ai/docs/Factory/buildsolve-example-6.webp

To visualize the constraints matrix, add a Convert Structure to Dataset task and link it to the Build/Solve task.
Open the Convert Structure to Dataset task and double check that Clusters has been selected in the Select the structure option.
Save and compute the task.

https://cdn.rulex.ai/docs/Factory/buildsolve-example-7.webp

Add a Data Manager task and link it to the Convert Structure to Dataset task to visualize the constraints matrix.

https://cdn.rulex.ai/docs/Factory/buildsolve-example-8.webp