A Dataset is created from one or many Tables and is used to collect and transform data for use in a Workbook. Before you create a Dataset, you will need at least one active Table. If you are referencing more than one Table in your Dataset, make sure you have correctly established connections between those Tables using the Numetric Dictionary. For an overview of the purpose of Datasets, see the Getting Started article Assembling Datasets.
To create a new Dataset, follow the steps outlined below:
1. Navigate to the Datasets section of Numetric, here all existing Datasets will be displayed and grouped according to their categories. Clicking on the title of an existing Dataset will allow you to view and edit the Dataset.
To create a new Dataset, click the New Dataset button in the top right corner.
2. Begin by selecting the Primary Table for this Dataset. Click on the + Primary Table button under the Tables heading in the sidebar.
Select the Primary Table from the drop-down menu or search by typing the name of the Primary Table you would like to select.
Note - determining the appropriate Primary Table is an essential step in the process of building a Dataset. Selecting a Primary Table is discussed in more detail in the Assembling Datasets article.
3. Once you have selected your Primary Table, you can decide which columns from the Table you would like to bring into the Dataset. While you must include the column containing the Primary Key, you can toggle all other columns on or off (a checked box indicates that it will be included in the Dataset). By default, all columns are added to the Dataset. To check or uncheck all columns, toggle the X Columns Selected checkbox under the Column search bar.
To help with larger Tables, you can search all available columns using the Search bar located under the Columns heading in the center of the page.
A preview of the Dataset will populate below the column selection section and will update to reflect any added/removed columns you may select. To view a preview of the Dataset at any time, click the Dataset Preview button, located in the bottom left corner of the screen.
4. Once you have selected your Primary Table for the Dataset, you can add any Secondary Tables you would like to include in this Dataset. Click + Add a Joined Table to select which Tables to add.
Numetric will automatically display any Tables that have existing Dictionary Terms present in the Dataset. Note that each Secondary Table you select will indicate the type of join required to add that Table to your Dataset (either a one to one or one to many join).
If no Secondary Tables are available for you to select, it is most likely for one of two reasons: either you have already added all available Tables to your Dataset, or there are no Tables with Dictionary Terms that link to your Dataset.
5. If all of the Tables in your Dataset are one to one Joins, you will not need to setup any aggregations.
However, if any of your Secondary Tables require a one to many join you will need to select an aggregation method for the columns in the Table.
For more information on what an Aggregation is, or how to determine which Aggregation method is best for your situation, see the Dataset Aggregations article.
To add an Aggregation, click + Add an Aggregation. You will then need to select the table column name that will require aggregation, select the aggregation method, and select the column in which you would like the Aggregated data to display.
6. With your Tables added to your Dataset, you are ready to conduct any needed Transformations. Dataset Transformations function exactly the same as Table Transformations. The main difference with Dataset Transformations is they allow you to conduct transformations with data from various Tables.
For more information on the difference between Dataset and Table Transformations, see the Assembling Datasets Article. For a detailed step-by-step of the Transformation process, see the Transformations articles.
7. Once all of your Transformations have been completed, click the Column Properties tab at the top to preview the data found in that column, and finalize the order and properties of the columns in your Dataset.
Each column in the Dataset is displayed in the sidebar on the left. You can change their order by clicking and dragging, or by entering the desired order number next to the title.
8. You can adjust the properties of each column by clicking on the column name in the Columns Sidebar. This will display a summary of the data found in that column, as well as the properties of that specific column.
Here you can adjust the name of that column in your Dataset (Dataset Column Alias), or enter a brief description of the data contained in that column (Description).
9. At the top of the Column Properties screen, there are two toggle switches under the Behavior heading.
The first toggle switch, Visible, determines if the column will be visible in the Dataset. Often, columns will be present in a dataset, and used for transformations or calculations, but you do not wish for that column to be visible in the finished Workbook. By toggling this option off (with the switch on the left, and colored grey), the column of data will still be present in the Dataset, but will not be visible.
The second toggle switch, Searchable, will determine if the values in this column of data will be searchable by users in the finished Workbook. Making text searchable is helpful for applying filters in Workbooks, however, you should be careful not to mark all data as searchable. Too much searchable data will make for a poor user experience when applying a filter in a Workbook.
10. With your Column Properties set, you are ready to finalize your Dataset. Click on the Dataset Properties tab at the top of the page to display the name and category of the Dataset, as well as define who can view or edit the Dataset.
To change the name of the Dataset, click the field located under the Dataset Name heading. Additionally, you can adjust the category of the Dataset by selecting the desired category from the drop-down menu.
The Members sidebar displays all of the current users with access to this specific Dataset. Here you can grant users or groups access to either view or edit this Dataset. It is important to note that because Datasets are used to provide data for a Workbook, anyone who has access to view a Workbook will also be automatically granted access to view any associated Datasets.
Under the Stats heading you can also see valuable information about this Dataset, including the size, number of rows, and the last time the Dataset was updated.
11. Once your Dataset is finalized and ready, click the Publish button at the top to create, and index the Dataset.
Clicking Publish will queue up your Dataset for indexing. This process may take several hours, to several days. Indexing all of your data prior to the creation of a Workbook allows Numetric to support lightning-fast reporting. With this indexing approach you can search, filter, adjust, and create new Metrics in Workbooks without having to re-index, or update your database.
If you wish to undo your changes, click the Revert button in the top corner. This will undo any changes made to the Dataset, and revert it back to the previous state.