Formatting Your Data
The Streams experience is a powerful way to view and analyze almost any type of data, but it has limitations. The most effective way of using Streams to work with your data is to ensure that your uploaded data is formatted in the best way. Please see below the techniques to best format your data:
Geographic Information
GeoJson
A file must be formatted in the standard GeoJson formating
A file can contain multiple properties, but it can’t deal with encapsulated properties for now, with multiple levels. The encapsulated properties will be automatically deleted from the source data.
{
"type": "Feature",
"geometry": {
"type": "Point",
"coordinates": [125.6, 10.1]
},
"properties": {
"name": "Dinagat Islands"
}
}
{
"type": "Feature",
"geometry": {
"type": "Point",
"coordinates": [125.6, 10.1]
},
"properties": {
"name": "Dinagat Islands",
"encapsulated_property": {
"property": "value"
}
}
}
CSV
It’s mandatory it contains a geometry information which can be found in 4 different ways :
- A column named geometry containing the feature geography in geojson format
"geometry": {
"type": "Point",
"coordinates": [125.6, 10.1]
}
- 2 columns named latitude and longitude, which will automatically build Point features
- 1 column named h3_token containing the h3_token, following the h3 convention
- 4 columns named like below which will automatically build LineString feature. Warning : The lines created will be a straight line, independent of the shape of the street feature
latitude_<something_1>
longitude_<something_1>
latitude_<something_2>
longitude_<something_2>
Time data
In order to be viewed as a Stream, the data must contain a time column, with data points covering at least two days.
The datetime column must respect this format : 2022-05-04 or 2022-05-04T22:00:00
If multiple datetime columns are in the file, the loading script will take the first one it sees- so you should only include one datetime column in your upload file.
Updated about 2 years ago