Document

dask 2021.10.0

timeseries(start='2000-01-01', end='2000-01-31', freq='1s', partition_freq='1d', dtypes={'name': <class 'str'>, 'id': <class 'int'>, 'x': <class 'float'>, 'y': <class 'float'>}, seed=None, **kwargs)

Parameters

start : datetime (or datetime-like string): Start of time series
end : datetime (or datetime-like string): End of time series
dtypes : dict: Mapping of column names to types. Valid types include {float, int, str, 'category'}
freq : string: String like '2s' or '1H' or '12W' for the time series frequency
partition_freq : string: String like '1M' or '2Y' to divide the dataframe into partitions
seed : int (optional): Randomstate seed
kwargs: :: Keywords to pass down to individual column creation functions. Keywords should be prefixed by the column name and then an underscore.

Create timeseries dataframe with random data

Examples

This example is valid syntax, but we were not able to check execution

>>> import dask
... df = dask.datasets.timeseries()
... df.head()  # doctest: +SKIP
          timestamp    id     name         x         y
2000-01-01 00:00:00   967    Jerry -0.031348 -0.040633
2000-01-01 00:00:01  1066  Michael -0.262136  0.307107
2000-01-01 00:00:02   988    Wendy -0.526331  0.128641
2000-01-01 00:00:03  1016   Yvonne  0.620456  0.767270
2000-01-01 00:00:04   998   Ursula  0.684902 -0.463278

This example is valid syntax, but we were not able to check execution

>>> df = dask.datasets.timeseries(
...     '2000', '2010',
...     freq='2H', partition_freq='1D', seed=1,  # data frequency
...     dtypes={'value': float, 'name': str, 'id': int},  # data types
...     id_lam=1000  # control number of items in id column
... )

See :

Back References

The following pages refer to to this document either explicitly or contain code examples using this.

dask.datasets.timeseries

Local connectivity graph

Hover to see nodes names; edges to Self not shown, Caped at 50 nodes.

Using a canvas is more power efficient and can get hundred of nodes ; but does not allow hyperlinks; , arrows or text (beyond on hover)

SVG is more flexible but power hungry; and does not scale well to 50 + nodes.

All aboves nodes referred to, (or are referred from) current nodes; Edges from Self to other have been omitted (or all nodes would be connected to the central node "self" which is not useful). Nodes are colored by the library they belong to, and scaled with the number of references pointing them

File: /dask/datasets.py#6
type: <class 'function'>
Commit: