DATAmaestro Analytics is a web based application dedicated to advanced analytics and allowing the user to perform all types of analytics:

  • Descriptive analytics to describe past behaviour/situation of systems
  • Diagnostic analytics to understand why such system behaviours/situations happened in the past 
  • Predictive analytics to predict outcome, behaviour of the system
  • Prescriptive analytics to discover optimum decisions so that the system behaves in an optimal way

Analytics workflow are structured in a logical way :

  1. Project are used to save all tasks built on the data 
  2. Data is used to link datasource(s) to the project
  3. Select is used to define subset of data to be analysed
  4. Transform is used to create new variables (columns) useful for further processing
  5. Visualize is used to explore and analyse data with visual tools and charts
  6. Model is used to apply machine learning models on selected data
  7. Report is used to extract results of the data analytics tasks

DATAmaestro is dedicated to end users with limited or no skills in data science. Limited or no coding is required to use DATAmaestro.

DATAmaestro Analytics Menu and Sidebar

In a DATAmaestro Analytics project, the main features are organized in a project workflow in the top menu. To access your saved project tasks, open the sidebar on the left.

The Analytics Menu

Items in the menu bar help you manage your projects through every stage.  


Menu Item
Description
ProjectCreate, open, delete or copy a project. You can also set preferences to customize the page appearance.
Data

Manage the data sources that support your projects. You can upload CSV or Excel files, or use data connections like AllegroGraph or DATAserver.

Excel plugin

To upload Excel files you must install a plugin, contact Technical Support.

Select

Create a new Record set or Variable set. For more information, see Data.

TransformUse operators to create or edit Function Variables expressions, as well as process control features including CUSUM, Statistical Process Control (SPC), and Principal Component Analysis (PCA).
VisualizeCreate histograms, scatter plots, box plots, trends and dendrograms based on data you select.
Models Use learning and test sets to build automatic learning models. Train your data using robust classification, regression and clustering techniques.
Reports Export subsets of your data to use for reporting purposes outside of DATAmaestro.
HelpFind answers to questions in the collection of user references and tutorials.
User MenuSwitch between Lake, Analytics or Dashboards, Change password, change Language preferences, and more. 


The Analytics Sidebar

In DATAmaestro Analytics, use the sidebar to access the elements that have been saved in your project. Elements are saved with their associated feature, which is represented by a grey icon. Hold your mouse pointer over an icon to see the name of the feature and the number of elements that are saved. For example, histograms: 


Icon
Active
Description

 

The current project has nine saved histograms.


To expand and collapse all tasks using the sidebar, click All at the top of the sidebar. The expanded panel displays all the saved elements for each feature.


Expanded Sidebar Panel
Sidebar Controls 

 

Search: Use the search field at the top to find saved elements.

Pin: To pin the sidebar panel and view all of the current page, click (). 

Open: To open an element, click the name in the sidebar.

New: To create a new element, click Add new... in the appropriate section.

Manage: To edit () or delete () elements, click the appropriate icon.  

Deleting and data dependency

If the element you want to delete is used by another task, the title of the task is displayed in the dialog box for you to confirm. If you delete the element, the depending tasks are also removed. For example, if you confirm and delete a function variable that is used for a histogram, a scatter plot and a decision tree, they will all be removed.


Sidebar Icons


Icon
Name
Description
Data sourcesData source(s) uploaded to support the project. The number of files is indicated on the active icon.
Variable setsVariable sets that have been created for the project. By default, all the individual variables appear in the Variable List.
Record setsRecord sets are created to establish specific record groups for a given data source. For example, records that correspond to a production regime, records associated with the normal variation range of a KPI variable.
Function variablesFunction variables are created to transform variables for models and advanced analysis.
HistogramsHistograms are typically used to illustrate the process distribution and are used to make predictions about a stable process.
Scatter plots

Scatter plots are typically used to show how much one variable is affected by another. Each row in the data table is represented by a marker whose position depends on its values in the columns set on the X- and Y-axes.

Box plots Box plots provide effective views to help identify outliers. They show a data set’s lowest value, highest value, median value, and the size of the first and third quartiles. The box plot is useful in analyzing small data sets that do not lend themselves easily to histograms.
Curves Curves are typically used to compare the temporal dynamic behaviour/trend of two or more numerical variables of a database. 
DendrogramsDendrograms and Correlation matrices provide a means of assessing dependency levels between variables. A dendrogram is a tree-structured graph used in heat maps to visualize the result of a hierarchical clustering calculation.
Linear regressions A simple linear regression is an approach to modeling the relationship between a scalar dependent variable Y and one explanatory variable denoted X. Several input variables may be combined to predict an output variable Y in the shape of a multi-linear regression.
Trees Decision trees and Regression trees in are important modeling methods that are used to accomplish a prediction objective (classification or regression).
Ensemble trees 

Ensemble trees are an extension of the regular tree methods; Extra Trees, Adaboost & MART.

Clustering K-Means and Subclu models in the project. 

Other tasksAll other items can be found under the Other menu, including models: Artificial Neural Networks, Partial Least Squares, k-Nearest Neighbors, Partial Dependence Plots, Sensitivity Analysis, Dynamic Inputs, PRIM analysis & Optimizer, Statistical tests: T test, Spearman and Pearson, as well as, Data Exports, Summary Charts, Process Flow and more. 



DATAmaestro Analytics は高度なアナリティクス専用の Web ベースのアプリケーションであり、次のようにあらゆる種類のアナリティクスを実行できます。

  • 過去のシステムの動作/状況を説明する記述的アナリティクス
  • 過去にシステムの動作/状況が発生した理由を理解する診断的アナリティクス 
  • 結果、システムの動作を予測する予測的アナリティクス
  • システムが最適な方法で動作する最適な決定を発見する処方的アナリティクス

アナリティクスワークフローは論理的に構造化されています。

  1. 「プロジェクト」はデータに構築されたすべてのタスクを保存するために使用されます。 
  2. 「データ」はデータソースをプロジェクトにリンクするために使用されます。
  3. 「選択」は分析するデータのサブセットを定義するために使用されます。
  4. 「解析」は視覚的なツールとグラフを使用して選択したデータを調査するために使用されます。
  5. 「変換」は処理を進める際に役立つ新しい属性 (列) を作成するために使用されます。
  6. 「モデル」は選択したデータに機械学習モデルを適用するために使用されます。
  7. 「レポート」はデータアナリティクスタスクの結果を抽出します。

 DATAmaestro はデータサイエンスや統計に慣れていないエンドユーザー向けです。DATAmaestro の使用にあたり、コーディング技術は必要ありません。