Class ReadRowsIterable (2.13.1)

ReadRowsIterable(reader, read_session=None)

An iterable of rows from a read session.

Parameters

NameDescription
reader google.cloud.bigquery_storage_v1.reader.ReadRowsStream

A read rows stream.

read_session Optional[ReadSession]

DEPRECATED. This argument was used to specify the schema of the rows in the stream, but now the first message in a read stream contains this information.

Inheritance

builtins.object > ReadRowsIterable

Properties

pages

A generator of all pages in the stream.

Returns
TypeDescription
types.GeneratorType[google.cloud.bigquery_storage_v1.ReadRowsPage]A generator of pages.

Methods

__iter__

__iter__()

Iterator for each row in all pages.

to_arrow

to_arrow()

Create a pyarrow.Table of all rows in the stream.

This method requires the pyarrow library and a stream using the Arrow format.

Returns
TypeDescription
pyarrow.TableA table of all rows in the stream.

to_dataframe

to_dataframe(dtypes=None)

Create a pandas.DataFrame of all rows in the stream.

This method requires the pandas libary to create a data frame and the fastavro library to parse row messages.

.. warning:: DATETIME columns are not supported. They are currently parsed as strings in the fastavro library.

Parameter
NameDescription
dtypes Map[str, Union[str, pandas.Series.dtype]]

Optional. A dictionary of column names pandas dtypes. The provided dtype is used when constructing the series for the column specified. Otherwise, the default pandas behavior is used.

Returns
TypeDescription
pandas.DataFrameA data frame of all rows in the stream.