Data calendar

The Data calendar is a very crucial part for your project because it is where you select the datasets that you want to work with. It also allows you to see a dataset’s data distribution over time and you can set time-related parameters for your project such as the time range, the frame duration and the preview frame (more on that later).

At the initial stage right after you created a new project, you have to select at least one dataset from the data calendar before moving on to the next step, operations.

It is always recommended to use the data calendar page to change your time-related settings because you will be able to visually see the impact of the new settings, for example whether you have data available for the settings or not (you can change those settings in other places in the tool as well).

Choosing your dataset

Choosing a dataset from the available list

available_datasetAs soon as you created a new project, you will be redirected to the data calendar page and since you have no dataset selected yet, the available dataset section will be opened automatically for you (you can do that manually by clicking Available Datasets menu on the bottom-left corner).

In the Available Datasets section, you will see all the datasets that you have access to. This includes your own datasets, public ones and ones that you were granted access to specifically.

In the list, you can click the dataset name to see the detail page of that dataset (see below for more information). You can read the information of that dataset and then click the Add button to start working with that dataset.

Once the dataset was added, that dataset will be removed from Available Datasets section and will be inserted to Selected Datasets section (you can only add one particular dataset once to a project).

You can add more than one dataset for your project, with a limit of 5 datasets per project.

 

Dataset description

If you click the dataset name in either the Selected Datasets or Available Datasets sections, a dataset detail window will be opened. It contains a lot of information regarding that particular dataset for example the availability over time, description, available fields, and other useful information.

dataset_detail

 

Removing a dataset once selected

To remove a selected dataset, you have to open the dataset detail page by clicking the dataset name in the Available Datasets section and click the Remove button.

Currently, you can only do this if you don’t have any operations attached to the dataset you intend to remove. Once you have an operator applied to your dataset, you have to go to Operations page first and remove the operator node that is attached to the dataset first in order to delete the dataset (you might have to remove multiple operators if order to be able to delete the operator directly attached to the dataset).

Selecting the project time range

Project time range is the overall range of data that you want to look at, e.g. maybe you’re only interested in taxi trips during the week of Christmas Day. To do this you would set the time range to that particular week. All taxi trips that do not fall into the set time range will be excluded.

selecting-project-time-range

The bar chart representing the distribution of records of a dataset over time. You can move the view in time by dragging the canvas. You can also zoom the view either by using your mouse wheel or by selecting pre-set zoom levels from the dropdown in the top right corner.

To modify your project time range, you can resize or move the blue rectangle in the center. If you drag the the left / right edge, you can resize the rectangle, and if you drag the center of the rectangle, you will be able to move it while keeping the time range the same.

Frame duration

The frame duration is the duration interval in which project time-range will be divided with. Using frame duration, you can define how much time each frame of the visualization covers. If you set this parameter to one hour for example, all the data during one hour will be represented in the visualization in one frame.

Another example is in the screenshot below where the project time-range is from 11 April 2011 18:00 until 19:00, and the frame duration is 15 mins. Based on these settings, the project will have 4 frames like this:

  1. 11 April 2011 18:00 – 18:15
  2. 11 April 2014 18:15 – 18:30
  3. 11 April 2014 18:00 – 18:45
  4. 11 April 2014 18:45 – 19:00

data_calendar

You can select any of these 4 frames as your preview frame. And then later on the visualisation page, you can create a visualization that animates over these 4 frames accordingly after you processed all the data in project time range.

Below is a list with all currently available frame duration settings:

  • 5 mins
  • 10 mins
  • 15 mins
  • 30 mins
  • 1 hour
  • 2 hours
  • 4 hours
  • 8 hours
  • 1 day
  • 7 days
  • 15 days
  • 30 days
  • 180 days
  • 365 days

An important thing to note is that if you change the frame duration, all the processing results of your operators that were previously computed are invalid (since you changed the frame duration which is the way the data is aggregated over time). The operators can be re-executed after a change to produce the new, correct result (the re-execution happens automatically after you open the operator configuration window after the change).

Preview frame

The preview frame is the selected frame which will be used in the project to provide previews of how operators affect your data. Additionally you can use the data from the preview frame to create an initial visualization of your data (after that you would typically process all frames so that you can animate your visualization over time, please see the visualization section to learn more).

You can change the active preview frame by clicking the top part of the slices within your project time range. The active frame will be highlighted differently compared to the rest of the frames. In the screenshot below, you can see that the active preview frame is the third slice with the greyish colour (18:30 – 18:45).

time-range-preview-frame

An important thing to note is that if you change the preview frame, all the processing results of your operators that were previously computed are invalid (since they were calculated for the preview frame that was set before). The operators can be re-executed after a change to produce the new, correct result (the re-execution happens automatically after you open the operator configuration window after the change).

Zoom in your data

zoom

The data calendar histogram view is zoomable which allows you to explore datasets both over very large time frames (years) and very small ones (minutes). To zoom-in or zoom-out, you can use your mouse wheel. Also, you can zoom to a specific zoom range relevant to your project by using the zoom dropdown, located at the top right corner.

The zoom shortcut has 3 targets:

  1. All data
  2. Project time range
  3. Preview frame

For example, let’s assume you selected a dataset that has data from 1st January 2011 until 1st May 2011. And further, let’s assume that you set the time range to 10th April 2011 00:00 until 15th April 2011 00:00 with a frame duration of 2 hours (preview frame: 12th April 2011 18:00 until 12 January 2011 20:00). Based on these settings the zoom shortcuts would take you to:

  1. All data: 1st January 2011 00:00 – 1st May 2011 00:00
  2. Project time range: 10th April 2011 00:00 – 15th April 2011 00:00
  3. Preview frame: 12th April 18:00 – 12th April 2011 20:00

Project summary

green_bar

The green bar on top is giving you summary of important informations regarding your current project. You can change the title of your workflow simply double clicking on it; you can easily see the Project Time, the Frame Duration and at the end the you know at which Preview Frame are you looking in that moment. You could also change your project timezone setting from this section.


Still unclear with the guides? Write to us your questions or head over to our FAQs section.