For more information, see the online learning platform
Fill missing values creates a variable or set of variables filling the missing values with a value designated by the strategy adopted. Missing values can be replaced by a default value, an average value, previous or next values and interpolation. It can be used for numeric and symbolic variables.
If there were missing values (∅) recorded in the Lake, these missing values are maintained when the data is resampled. Therefore, you may choose a strategy to “Fill missing values”. |
To create a strategy of fill missing value of a certain variable:
Enter a name for the new variable set.
Note that Filling Missing Values creates a Variable set, containing several new variables. Therefore, to check your Filled Missing variables you can go to the third icon on the left bar. |
The types used:
The method depends on the type of data to fill and the logic for choosing a sampling method is similar to resampling data.
You can also filter missing values using “Record Sets”, so that if a value is missing for one variable, the whole record line will be removed from the selection (=from the record set) Be aware that such a rule: will remove all rows where "Profit/hr" is below 3000, including any rows where “Profit/hr” is missing and that this impacts all variables (Record sets filter entire rows). |