
class towhee.data_loader.DataLoader(data_source: Iterable | Callable, parser: Callable | None = None, batch_size: int | None = None)[source]

Bases: object

  • data_source (Uniton[Iterable, Callable]) – Read data from a data_source (can be an iterable or a callable)

  • parser (Callable) – Parse the read data through the parser function to get the input that the pipeline can process.

  • batch_size (int) – If batch_size is specified, batch the read data into batches of size batch_size, otherwise yield single data directly


>>> from towhee import DataLoader, pipe, ops
>>> p = pipe.input('num').map('num', 'ret', lambda x: x + 1).output('ret')
>>> for data in DataLoader([{'num': 1}, {'num': 2}, {'num': 3}], parser=lambda x: x['num']):
>>>     print(p(data).to_list(kv_format=True))
[{'ret': 2}]
[{'ret': 3}]
[{'ret': 4}]
