tlc.core.objects.tables.from_url.table_from_parquet#

A table populated from a Parquet file loaded from a URL

Module Contents#

Classes#

Class

Description

TableFromParquet

A table populated from a Parquet file loaded from a URL

API#

class tlc.core.objects.tables.from_url.table_from_parquet.TableFromParquet(url: tlc.core.url.Url | None = None, created: str | None = None, description: str | None = None, row_cache_url: tlc.core.url.Url | None = None, row_cache_populated: bool | None = None, override_table_rows_schema: Any = None, init_parameters: Any = None, input_url: tlc.core.url.Url | None = None, input_tables: list[tlc.core.url.Url] | None = None)#

Bases: tlc.core.objects.tables.in_memory_columns_table._InMemoryColumnsTable

A table populated from a Parquet file loaded from a URL

Parameters:
  • url – The URL of the table.

  • created – The creation time of the table.

  • description – The description of the table.

  • row_cache_url – The URL of the row cache.

  • row_cache_populated – Whether the row cache is populated.

  • override_table_rows_schema – The schema to override the table rows schema.

  • init_parameters – The initial parameters of the table.

  • input_tables – A list of Table URLs that are considered direct predecessors in this table’s lineage. This parameter serves as an explicit mechanism for tracking table relationships beyond the automatic lineage tracing typically managed by subclasses.

is_all_parquet() bool#

This table is all Parquet.

get_rows_as_binary(exclude_bulk_data: bool = False) bytes#

Return the table rows as binary data

For TableFromParquet, if the data is produced without any transformation, it is possible to pass the input file. Otherwise the data normal binary production pipeline is used.