3LC Python Package Version 2.14¶
2.14.1¶
Features¶
[15098] Added an unauthenticated “/live” endpoint to the object service that can be used to check whether the object service is live
Enhancements and Fixes¶
[15029] Fixed issue with handling expiration of JWT from dashboard
[51090] Modified logic during indexing to avoid checking whether a URL is a directory when the URL itself is sufficient to provide the information needed. This also avoids an issue when using GCS in some scenarios.
[15102] Made a fix so that
TableWriterproperly updates the timestamp of theTablebeing written[15110] Provide additional logging and response details when Object Service request authentication fails due to time skew when using a license key in a 3LC Enterprise Customer Managed deployment
2.14.0¶
Features¶
[14955] Added timestamp HMAC authentication to Object Service request handling when using a license key in a 3LC Enterprise Customer Managed deployment. This adds an additional layer of security that ensures that the Object Service only handles requests from trusted Dashboard instances. Note that this requires the Object Service and Dashboard Service to be configured with a shared authentication secret. See the documentation on Secure Communication for details.
[14960] Added support for segmentation in
TableFromCoco[12915, 14916] Added
Tabledelete_column,delete_columns,delete_row,delete_rows[14974] Implemented
Schema.__getitem__as a shortcut to allow for e.g.schema["bbs"]["bb_list"]["label"]instead ofschema.values["bbs"].values["bb_list"].values["label"]
Enhancements and Fixes¶
[14973] Override
TableFromParquetget_row_cache_sizeto fallback to input parquet size[14963] Made
Schema.sample_typedefault toNoneto clearly distinguish between inheritance and explicit overrides, including the empty string""[14972] Made it so that
SampleType.from_structurereturns aCategoricalLabelwhen the schema includes a map[14964] Made
FloatVector2andFloatVector3builtin schemas consistentExtracted
Urland related classes into a newtlcurlmodule while preserving symbol backwards compatibility. This allows for useUrlrelated types outside of the full indexing machinery, such as for configuration and logging scenarios.[14790] Enhanced logging capabilities and configuration to make logging consistent across
3lcmodules[12351] For
EmbeddingsMetricsCollector, include flatten strategy in embeddings column name, which allows for collecting embeddings on the same layer but with a different flatten strategy, which would previously not work because name was only unique per layer and not per flatten strategy[14186] Made TableWriter a context manager and called
finalize()on exit[13485, 14547] Catch
ArrowTypeErrorswhen converting batches topyarrow.RecordBatchto allow for providing a clearer error message when passing data that does not match the schema to aTableWriter[14546] Handle numpy numbers in
SampleType.from_sample[14843] Added validation to user-provided names for tables, datasets, projects, columns, and map elements to avoid names that would later cause issues for a variety of reasons, e.g. because they are illegal as file / directory names
[14157] Made it so that
MeanAggregatorignoresNaN,inf,-infandNonein its mean computation[13458] Allow numpy types in
SampleTypeFloatandIntensure_sample_valid[14713] Provide a clearer error message when
Table.join_tablefails due to inconsistent schemas[14857] Made it so that metrics collected by the Hugging Face Trainer have prefixes for the split they are for (“train” or “eval”)
[14616] Give targeted error message when
torch.Tensorornp.ndarraypassed as structure toSampleType.from_structure