dataframe
DataFrame(data)
Bases: OfflineEnvironment
A dataset environment.
This environment represents static tabular datasets.
Attributes:
Name | Type | Description |
---|---|---|
data |
LazyFrame
|
The data to represent. |
Initialize the dataset environment.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
data
|
DataFrame | LazyFrame
|
The data to represent. |
required |
Source code in src/flowcean/polars/environments/dataframe.py
29 30 31 32 33 34 35 36 37 38 39 40 |
|
from_csv(path, separator=',')
classmethod
Load a dataset from a CSV file.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
path
|
str | Path
|
Path to the CSV file. |
required |
separator
|
str
|
Value separator. Defaults to ",". |
','
|
Source code in src/flowcean/polars/environments/dataframe.py
42 43 44 45 46 47 48 49 50 51 52 |
|
from_json(path)
classmethod
Load a dataset from a JSON file.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
path
|
str | Path
|
Path to the JSON file. |
required |
Source code in src/flowcean/polars/environments/dataframe.py
54 55 56 57 58 59 60 61 62 |
|
from_parquet(path)
classmethod
Load a dataset from a Parquet file.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
path
|
str | Path
|
Path to the Parquet file. |
required |
Source code in src/flowcean/polars/environments/dataframe.py
64 65 66 67 68 69 70 71 72 |
|
from_yaml(path)
classmethod
Load a dataset from a YAML file.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
path
|
str | Path
|
Path to the YAML file. |
required |
Source code in src/flowcean/polars/environments/dataframe.py
74 75 76 77 78 79 80 81 82 |
|
from_uri(uri)
classmethod
Load a dataset from a URI.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
uri
|
str
|
The URI to load the dataset from. |
required |
Source code in src/flowcean/polars/environments/dataframe.py
84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 |
|
__len__()
Return the number of samples in the dataset.
Source code in src/flowcean/polars/environments/dataframe.py
109 110 111 112 113 114 115 116 117 |
|
InvalidUriSchemeError(scheme)
Bases: Exception
Exception raised when an URI scheme is invalid.
Initialize the InvalidUriSchemeError.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
scheme
|
str
|
Invalid URI scheme. |
required |
Source code in src/flowcean/polars/environments/dataframe.py
130 131 132 133 134 135 136 137 138 |
|
UnsupportedFileTypeError(suffix)
Bases: Exception
Exception raised when a file type is not supported.
Initialize the UnsupportedFileTypeError.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
suffix
|
str
|
File type suffix. |
required |
Source code in src/flowcean/polars/environments/dataframe.py
144 145 146 147 148 149 150 |
|
collect(environment, n=None, *, progress_bar=True)
Collect data from an environment.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
environment
|
Iterable[LazyFrame] | Collection[LazyFrame]
|
The environment to collect data from. |
required |
n
|
int | None
|
Number of samples to collect. If None, all samples are collected. |
None
|
progress_bar
|
bool | dict[str, Any]
|
Whether to show a progress bar. If a dictionary is provided, it will be passed to the progress bar. |
True
|
Returns:
Type | Description |
---|---|
DataFrame
|
The collected dataset. |
Source code in src/flowcean/polars/environments/dataframe.py
153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 |
|