tiledbsoma.io.to_anndata¶
- tiledbsoma.io.to_anndata(experiment: ~tiledbsoma._experiment.Experiment, measurement_name: str, *, X_layer_name: str | ~tiledbsoma._util.Sentinel | None = <tiledbsoma._util.Sentinel object>, extra_X_layer_names: ~typing.Sequence[str] | ~typing.KeysView[str] | None = None, obs_id_name: str | None = None, var_id_name: str | None = None, obsm_varm_width_hints: dict[str, dict[str, int]] | None = None, uns_keys: ~typing.Sequence[str] | None = None) AnnData ¶
Converts the experiment group to AnnData format.
The choice of matrix formats is following what we often see in input
.h5ad
files:X
asscipy.sparse.csr_matrix
obs
,var
aspandas.dataframe
obsm
,varm
arrays asnumpy.ndarray
obsp
,varp
arrays asscipy.sparse.csr_matrix
The
X_layer_name
is the name of the TileDB-SOMA measurement’sX
collection which will be outgested to the resulting AnnData object’sadata.X
. IfX_layer_name
is unspecified, and the Measurement contains an X layer named “data”, it will be returned. IfX_layer_name
isNone
, then the return value’sadata.X
will be None, andadata.layers
will be unpopulated. IfX_layer_name
is a string, thenadata.X
will be taken from this layer name within the input measurement, and it will be an error if the measurement’sX
does not contain that layer name.The
extra_X_layer_names
are used to specify how the outputadata
object’sadata.layers
is populated. The default behavior –extra_X_layer_names
beingNone
– means thatadata.layers
will be empty. Ifextra_X_layer_names
is a provided list these will be used for populatingadata.layers
. If you want all the layers to be outgested, without having to name them individually, you can useextra_X_layer_names=experiment.ms[measurement_name].X.keys()
. To make this low-friction for you, we introduce one more feature: we’ll ignoreX_layer_name
when populatingadata.layers
. For example, if X keys are"a"
,"b"
,"c"
,"d"
, and you sayX_layer_name="b"
andextra_X_layer_names=experiment.ms[measurement_name].X.keys()
, we’ll not write"b"
toadata.layers
.The
obs_id_name
andvar_id_name
are columns within the TileDB-SOMA experiment which will become index names within the resulting AnnData object’sobs
/var
dataframes. If not specified as arguments, the TileDB-SOMA’s dataframes will be checked for an original-index-name key. When that also is unavailable, these default to"obs_id"
and"var_id"
, respectively.The
obsm_varm_width_hints
is optional. If provided, it should be of the form{"obsm":{"X_tSNE":2}}
to aid with export errors.If
uns_keys
is provided, only the specified top-leveluns
keys are extracted. The default is to extract them all. Useuns_keys=[]
to not outgest anyuns
keys.Lifecycle
Maturing.