ContinuousTimeSegmentation Configuration
To initialise and run the component two configs are used - general_config.ini
and time_segments.ini
. In general_config.ini
to execute the component specify all paths to its four corresponding data objects (input + output). Example:
[Paths.Silver]
event_data_silver_flagged = ${Paths:silver_dir}/mno_events_flagged
cell_intersection_groups_data_silver = ${Paths:silver_dir}/cell_intersection_groups
time_segments_silver = ${Paths:silver_dir}/time_segments
In time_segments.ini parameters are as follows:
-
data_period_start - string, format should be “yyyy-MM-dd“ (e.g. 2023-01-01), the date from which start Event Cleaning
-
data_period_end - string, format should be “yyyy-MM-dd“ (e.g. 2023-01-05), the date till which perform Event Cleaning
-
is_first_run - boolean, if True, the component won't use previously calculated time segments. If False, the component will use last calculated time segment per device.
-
event_error_flags_to_include - list of integers, the list of error flags that should be included in the time segments processing. Default value is [0], so only events with no errors are included.
-
min_time_stay_s - integer, the minimum dwell time in seconds for a time segments to be considered as a "stay". Default value is 15 minutes.
-
max_time_missing_stay_s - integer, maximum time difference between events to be considered a “stay”. If larger, the time segment will be marked “unknown”. Default value is 12 hours to support devices being offline at home or work addresses.
-
max_time_missing_move_s - integer, maximum time difference between events to be considered a “move”. If larger, the time segment will be marked “unknown”. Default value is 2 hours.
-
pad_time_s - integer, half the size of an isolated time segment: between two “unknowns” time segments. It expands the isolated event in time, by “padding” from the “unknown” time segments on both sides. Default value is 5 minutes.
Configuration example
[ContinuousTimeSegmentation]
data_period_start = 2023-01-01
data_period_end = 2023-01-05
is_first_run = true
event_error_flags_to_include = [0]
min_time_stay_s = 900
max_time_missing_stay_s = 43200
max_time_missing_move_s = 7200
pad_time_s = 300