bioimageio.core
bioimageio.core
Python specific core utilities for bioimage.io resources (in particular DL models).
Get started
To get started we recommend installing bioimageio.core with conda together with a deep
learning framework, e.g. pytorch, and run a few bioimageio
commands to see what
bioimage.core has to offer:
install with conda (for more details on conda environments, checkout the conda docs)
conda install -c conda-forge bioimageio.core pytorch
test a model
$ bioimageio test powerful-chipmunk ...
(Click to expand output)
✔️ bioimageio validation passed ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ source https://uk1s3.embassy.ebi.ac.uk/public-datasets/bioimage.io/powerful-chipmunk/1/files/rdf.yaml format version model 0.4.10 bioimageio.spec 0.5.3post4 bioimageio.core 0.6.8 ❓ location detail ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ✔️ initialized ModelDescr to describe model 0.4.10 ✔️ bioimageio.spec format validation model 0.4.10 🔍 context.perform_io_checks True 🔍 context.root https://uk1s3.embassy.ebi.ac.uk/public-datasets/bioimage.io/powerful-chipmunk/1/files 🔍 context.known_files.weights.pt 3bd9c518c8473f1e35abb7624f82f3aa92f1015e66fb1f6a9d08444e1f2f5698 🔍 context.known_files.weights-torchscript.pt 4e568fd81c0ffa06ce13061327c3f673e1bac808891135badd3b0fcdacee086b 🔍 context.warning_level error ✔️ Reproduce test outputs from test inputs ✔️ Reproduce test outputs from test inputs
or
$ bioimageio test impartial-shrimp ...
(Click to expand output)
✔️ bioimageio validation passed ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ source https://uk1s3.embassy.ebi.ac.uk/public-datasets/bioimage.io/impartial-shrimp/1.1/files/rdf.yaml format version model 0.5.3 bioimageio.spec 0.5.3.2 bioimageio.core 0.6.9 ❓ location detail ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ✔️ initialized ModelDescr to describe model 0.5.3 ✔️ bioimageio.spec format validation model 0.5.3 🔍 context.perform_io_checks False 🔍 context.warning_level error ✔️ Reproduce test outputs from test inputs (pytorch_state_dict) ✔️ Run pytorch_state_dict inference for inputs with batch_size: 1 and size parameter n: 0 ✔️ Run pytorch_state_dict inference for inputs with batch_size: 2 and size parameter n: 0 ✔️ Run pytorch_state_dict inference for inputs with batch_size: 1 and size parameter n: 1 ✔️ Run pytorch_state_dict inference for inputs with batch_size: 2 and size parameter n: 1 ✔️ Run pytorch_state_dict inference for inputs with batch_size: 1 and size parameter n: 2 ✔️ Run pytorch_state_dict inference for inputs with batch_size: 2 and size parameter n: 2 ✔️ Reproduce test outputs from test inputs (torchscript) ✔️ Run torchscript inference for inputs with batch_size: 1 and size parameter n: 0 ✔️ Run torchscript inference for inputs with batch_size: 2 and size parameter n: 0 ✔️ Run torchscript inference for inputs with batch_size: 1 and size parameter n: 1 ✔️ Run torchscript inference for inputs with batch_size: 2 and size parameter n: 1 ✔️ Run torchscript inference for inputs with batch_size: 1 and size parameter n: 2 ✔️ Run torchscript inference for inputs with batch_size: 2 and size parameter n: 2
- run prediction on your data
display the
bioimageio-predict
command help to get an overview:$ bioimageio predict --help ...
(Click to expand output)
usage: bioimageio predict [-h] [--inputs Sequence[Union[str,Annotated[Tuple[str,...],MinLenmin_length=1]]]] [--outputs {str,Tuple[str,...]}] [--overwrite bool] [--blockwise bool] [--stats Path] [--preview bool] [--weight_format {typing.Literal['keras_hdf5','onnx','pytorch_state_dict','tensorflow_js','tensorflow_saved_model_bundle','torchscript'],any}] [--example bool] SOURCE bioimageio-predict - Run inference on your data with a bioimage.io model. positional arguments: SOURCE Url/path to a bioimageio.yaml/rdf.yaml file or a bioimage.io resource identifier, e.g. 'affable-shark' optional arguments: -h, --help show this help message and exit --inputs Sequence[Union[str,Annotated[Tuple[str,...],MinLen(min_length=1)]]] Model input sample paths (for each input tensor) The input paths are expected to have shape... - (n_samples,) or (n_samples,1) for models expecting a single input tensor - (n_samples,) containing the substring '{input_id}', or - (n_samples, n_model_inputs) to provide each input tensor path explicitly. All substrings that are replaced by metadata from the model description: - '{model_id}' - '{input_id}' Example inputs to process sample 'a' and 'b' for a model expecting a 'raw' and a 'mask' input tensor: --inputs="[["a_raw.tif","a_mask.tif"],["b_raw.tif","b_mask.tif"]]" (Note that JSON double quotes need to be escaped.) Alternatively a `bioimageio-cli.yaml` (or `bioimageio-cli.json`) file may provide the arguments, e.g.: ```yaml inputs: - [a_raw.tif, a_mask.tif] - [b_raw.tif, b_mask.tif]
`.npy` and any file extension supported by imageio are supported. Aavailable formats are listed at https://imageio.readthedocs.io/en/stable/formats/index.html#all-formats. Some formats have additional dependencies. (default: ('{input_id}/001.tif',))
--outputs {str,Tuple[str,...]} Model output path pattern (per output tensor)
All substrings that are replaced: - '{model_id}' (from model description) - '{output_id}' (from model description) - '{sample_id}' (extracted from input paths) (default: outputs_{model_id}/{output_id}/{sample_id}.tif)
--overwrite bool allow overwriting existing output files (default: False) --blockwise bool process inputs blockwise (default: False) --stats Path path to dataset statistics (will be written if it does not exist, but the model requires statistical dataset measures) (default: dataset_statistics.json) --preview bool preview which files would be processed and what outputs would be generated. (default: False) --weight_format {typing.Literal['keras_hdf5','onnx','pytorch_state_dict','tensorflow_js','tensorflow_saved_model_bundle','torchscript'],any} The weight format to use. (default: any) --example bool generate and run an example
1. downloads example model inputs 2. creates a `{model_id}_example` folder 3. writes input arguments to `{model_id}_example/bioimageio-cli.yaml` 4. executes a preview dry-run 5. executes prediction with example input (default: False)
</details>
- create an example and run prediction locally!
```console
$ bioimageio predict impartial-shrimp --example=True
...
<details>
<summary>(Click to expand output)</summary>
🛈 bioimageio prediction preview structure:
{'{sample_id}': {'inputs': {'{input_id}': '<input path>'},
'outputs': {'{output_id}': '<output path>'}}}
🔎 bioimageio prediction preview output:
{'1': {'inputs': {'input0': 'impartial-shrimp_example/input0/001.tif'},
'outputs': {'output0': 'impartial-shrimp_example/outputs/output0/1.tif'}}}
predict with impartial-shrimp: 100%|███████████████████████████████████████████████████| 1/1 [00:21<00:00, 21.76s/sample]
🎉 Sucessfully ran example prediction!
To predict the example input using the CLI example config file impartial-shrimp_example\bioimageio-cli.yaml, execute `bioimageio predict` from impartial-shrimp_example:
$ cd impartial-shrimp_example
$ bioimageio predict "impartial-shrimp"
Alternatively run the following command in the current workind directory, not the example folder:
$ bioimageio predict --preview=False --overwrite=True --stats="impartial-shrimp_example/dataset_statistics.json" --inputs="[[\"impartial-shrimp_example/input0/001.tif\"]]" --outputs="impartial-shrimp_example/outputs/{output_id}/{sample_id}.tif" "impartial-shrimp"
(note that a local 'bioimageio-cli.json' or 'bioimageio-cli.yaml' may interfere with this)
</details>
Installation
Via Conda
The bioimageio.core
package can be installed from conda-forge via
conda install -c conda-forge bioimageio.core
If you do not install any additional deep learning libraries, you will only be able to use general convenience
functionality, but not any functionality depending on model prediction.
To install additional deep learning libraries add pytorch
, onnxruntime
, keras
or tensorflow
.
Deeplearning frameworks to consider installing alongside bioimageio.core
:
Via pip
The package is also available via pip
(e.g. with recommended extras onnx
and pytorch
):
pip install "bioimageio.core[onnx,pytorch]"
🐍 Use in Python
bioimageio.core
is a python package that implements prediction with bioimageio models
including standardized pre- and postprocessing operations.
These models are described by---and can be loaded with---the bioimageio.spec package.
In addition bioimageio.core provides functionality to convert model weight formats.
Documentation
Here you find the bioimageio.core documentation.
Presentations
Examples
- Notebooks that save and load resource descriptions and validate their format (using bioimageio.spec, a dependency of bioimageio.core)
- load_model_and_create_your_own.ipynb
- dataset_creation.ipynb
- Use the described resources in Python with bioimageio.core
- model_usage.ipynb
💻 Use the Command Line Interface
bioimageio.core
installs a command line interface (CLI) for testing models and other functionality.
You can list all the available commands via:
bioimageio
For examples see Get started.
CLI inputs from file
For convenience the command line options (not arguments) may be given in a bioimageio-cli.json
or bioimageio-cli.yaml
file, e.g.:
# bioimageio-cli.yaml
inputs: inputs/*_{tensor_id}.h5
outputs: outputs_{model_id}/{sample_id}_{tensor_id}.h5
overwrite: true
blockwise: true
stats: inputs/dataset_statistics.json
Set up Development Environment
To set up a development conda environment run the following commands:
conda env create -f dev/env.yaml
conda activate core
pip install -e . --no-deps
There are different environment files available that only install tensorflow or pytorch as dependencies, see dev folder.
Logging level
bioimageio.spec
and bioimageio.core
use loguru for logging, hence the logging level
may be controlled with the LOGURU_LEVEL
environment variable.
Changelog
next release
- fixes:
- CLI
- improved handling of summary argument to not create a path with brackets when given a list of paths.
- improved backward compatibility when runnig tests for models specifying an older bioimageio.core version in their environment.
This is relevant when using
runtime_env="as-described"
. It works by simply trying option--summary
(new option name) and--summary-path
(outdated option name).
- CLI
0.9.0
- update to bioimageio.spec 0.5.4.3
0.8.0
- breaking: removed
decimals
argument from bioimageio CLI andbioimageio.core.commands.test()
- New feature:
bioimageio.core.test_description
accepts runtime_env and run_command to test a resource using the conda environment described by that resource (or another specified conda env) - new CLI command:
bioimageio add-weights
(and utility function: bioimageio.core.add_weights) - removed
bioimageio.core.proc_ops.get_proc_class
in favor ofbioimageio.core.proc_ops.get_proc
- new CLI command:
bioimageio update-format
- new CLI command:
bioimageio update-hashes
0.7.0
- breaking:
- bioimageio CLI now has implicit boolean flags
- non-breaking:
- use new
ValidationDetail.recommended_env
inValidationSummary
- improve
get_io_sample_block_metas()
- now works for sufficiently large, but not exactly shaped inputs
- update to support
zipfile.ZipFile
object with bioimageio.spec==0.5.3.5 - add io helpers
resolve
andresolve_and_extract
- added
enable_determinism
function and determinism input argument for testing with seeded random generators and optionally (determinsim=="full") instructing DL frameworks to use deterministic algorithms.
- use new
0.6.10
- fix #423
0.6.9
- improve bioimageio command line interface (details in #157)
- add
predict
command - package command input
path
is now required
- add
0.6.8
- testing model inference will now check all weight formats (previously only the first one for which model adapter creation succeeded had been checked)
- fix predict with blocking (Thanks @thodkatz)
0.6.7
predict()
argumentinputs
may be sample
0.6.6
- add aliases to match previous API more closely
0.6.5
- improve adapter error messages
0.6.4
- add
bioimageio validate-format
command - improve error messages and display of command results
0.6.3
- Fix #386
- (in model inference testing) stop assuming model inputs are tileable
0.6.2
- Fix #384
0.6.1
0.6.0
- add compatibility with new bioimageio.spec 0.5 (0.5.2post1)
- improve interfaces
0.5.10
1""" 2.. include:: ../../README.md 3""" 4 5from bioimageio.spec import ( 6 ValidationSummary, 7 build_description, 8 dump_description, 9 load_dataset_description, 10 load_description, 11 load_description_and_validate_format_only, 12 load_model_description, 13 save_bioimageio_package, 14 save_bioimageio_package_as_folder, 15 save_bioimageio_yaml_only, 16 validate_format, 17) 18 19from . import ( 20 axis, 21 block_meta, 22 cli, 23 commands, 24 common, 25 digest_spec, 26 io, 27 model_adapters, 28 prediction, 29 proc_ops, 30 proc_setup, 31 sample, 32 stat_calculators, 33 stat_measures, 34 tensor, 35) 36from ._prediction_pipeline import PredictionPipeline, create_prediction_pipeline 37from ._resource_tests import ( 38 enable_determinism, 39 load_description_and_test, 40 test_description, 41 test_model, 42) 43from ._settings import settings 44from .axis import Axis, AxisId 45from .backends import create_model_adapter 46from .block_meta import BlockMeta 47from .common import MemberId 48from .prediction import predict, predict_many 49from .sample import Sample 50from .stat_calculators import compute_dataset_measures 51from .stat_measures import Stat 52from .tensor import Tensor 53from .utils import VERSION 54from .weight_converters import add_weights 55 56__version__ = VERSION 57 58 59# aliases 60test_resource = test_description 61"""alias of `test_description`""" 62load_resource = load_description 63"""alias of `load_description`""" 64load_model = load_model_description 65"""alias of `load_model_description`""" 66 67__all__ = [ 68 "__version__", 69 "add_weights", 70 "axis", 71 "Axis", 72 "AxisId", 73 "block_meta", 74 "BlockMeta", 75 "build_description", 76 "cli", 77 "commands", 78 "common", 79 "compute_dataset_measures", 80 "create_model_adapter", 81 "create_prediction_pipeline", 82 "digest_spec", 83 "dump_description", 84 "enable_determinism", 85 "io", 86 "load_dataset_description", 87 "load_description_and_test", 88 "load_description_and_validate_format_only", 89 "load_description", 90 "load_model_description", 91 "load_model", 92 "load_resource", 93 "MemberId", 94 "model_adapters", 95 "predict_many", 96 "predict", 97 "prediction", 98 "PredictionPipeline", 99 "proc_ops", 100 "proc_setup", 101 "sample", 102 "Sample", 103 "save_bioimageio_package_as_folder", 104 "save_bioimageio_package", 105 "save_bioimageio_yaml_only", 106 "settings", 107 "stat_calculators", 108 "stat_measures", 109 "Stat", 110 "tensor", 111 "Tensor", 112 "test_description", 113 "test_model", 114 "test_resource", 115 "validate_format", 116 "ValidationSummary", 117]
18def add_weights( 19 model_descr: ModelDescr, 20 *, 21 output_path: DirectoryPath, 22 source_format: Optional[WeightsFormat] = None, 23 target_format: Optional[WeightsFormat] = None, 24 verbose: bool = False, 25) -> Optional[ModelDescr]: 26 """Convert model weights to other formats and add them to the model description 27 28 Args: 29 output_path: Path to save updated model package to. 30 source_format: convert from a specific weights format. 31 Default: choose automatically from any available. 32 target_format: convert to a specific weights format. 33 Default: attempt to convert to any missing format. 34 devices: Devices that may be used during conversion. 35 verbose: log more (error) output 36 37 Returns: 38 - An updated model description if any converted weights were added. 39 - `None` if no conversion was possible. 40 """ 41 if not isinstance(model_descr, ModelDescr): 42 if model_descr.type == "model" and not isinstance(model_descr, InvalidDescr): 43 raise TypeError( 44 f"Model format {model_descr.format} is not supported, please update" 45 + f" model to format {ModelDescr.implemented_format_version} first." 46 ) 47 48 raise TypeError(type(model_descr)) 49 50 # save model to local folder 51 output_path = save_bioimageio_package_as_folder( 52 model_descr, output_path=output_path 53 ) 54 # reload from local folder to make sure we do not edit the given model 55 _model_descr = load_model_description(output_path, perform_io_checks=False) 56 assert isinstance(_model_descr, ModelDescr) 57 model_descr = _model_descr 58 del _model_descr 59 60 if source_format is None: 61 available = set(model_descr.weights.available_formats) 62 else: 63 available = {source_format} 64 65 if target_format is None: 66 missing = set(model_descr.weights.missing_formats) 67 else: 68 missing = {target_format} 69 70 originally_missing = set(missing) 71 72 if "pytorch_state_dict" in available and "torchscript" in missing: 73 logger.info( 74 "Attempting to convert 'pytorch_state_dict' weights to 'torchscript'." 75 ) 76 from .pytorch_to_torchscript import convert 77 78 try: 79 torchscript_weights_path = output_path / "weights_torchscript.pt" 80 model_descr.weights.torchscript = convert( 81 model_descr, 82 output_path=torchscript_weights_path, 83 use_tracing=False, 84 ) 85 except Exception as e: 86 if verbose: 87 traceback.print_exception(e) 88 89 logger.error(e) 90 else: 91 available.add("torchscript") 92 missing.discard("torchscript") 93 94 if "pytorch_state_dict" in available and "torchscript" in missing: 95 logger.info( 96 "Attempting to convert 'pytorch_state_dict' weights to 'torchscript' by tracing." 97 ) 98 from .pytorch_to_torchscript import convert 99 100 try: 101 torchscript_weights_path = output_path / "weights_torchscript_traced.pt" 102 103 model_descr.weights.torchscript = convert( 104 model_descr, 105 output_path=torchscript_weights_path, 106 use_tracing=True, 107 ) 108 except Exception as e: 109 if verbose: 110 traceback.print_exception(e) 111 112 logger.error(e) 113 else: 114 available.add("torchscript") 115 missing.discard("torchscript") 116 117 if "torchscript" in available and "onnx" in missing: 118 logger.info("Attempting to convert 'torchscript' weights to 'onnx'.") 119 from .torchscript_to_onnx import convert 120 121 try: 122 onnx_weights_path = output_path / "weights.onnx" 123 model_descr.weights.onnx = convert( 124 model_descr, 125 output_path=onnx_weights_path, 126 ) 127 except Exception as e: 128 if verbose: 129 traceback.print_exception(e) 130 131 logger.error(e) 132 else: 133 available.add("onnx") 134 missing.discard("onnx") 135 136 if "pytorch_state_dict" in available and "onnx" in missing: 137 logger.info("Attempting to convert 'pytorch_state_dict' weights to 'onnx'.") 138 from .pytorch_to_onnx import convert 139 140 try: 141 onnx_weights_path = output_path / "weights.onnx" 142 143 model_descr.weights.onnx = convert( 144 model_descr, 145 output_path=onnx_weights_path, 146 verbose=verbose, 147 ) 148 except Exception as e: 149 if verbose: 150 traceback.print_exception(e) 151 152 logger.error(e) 153 else: 154 available.add("onnx") 155 missing.discard("onnx") 156 157 if missing: 158 logger.warning( 159 f"Converting from any of the available weights formats {available} to any" 160 + f" of {missing} failed or is not yet implemented. Please create an issue" 161 + " at https://github.com/bioimage-io/core-bioimage-io-python/issues/new/choose" 162 + " if you would like bioimageio.core to support a particular conversion." 163 ) 164 165 if originally_missing == missing: 166 logger.warning("failed to add any converted weights") 167 return None 168 else: 169 logger.info("added weights formats {}", originally_missing - missing) 170 # resave model with updated rdf.yaml 171 _ = save_bioimageio_package_as_folder(model_descr, output_path=output_path) 172 tested_model_descr = load_description_and_test(model_descr) 173 assert isinstance(tested_model_descr, ModelDescr) 174 return tested_model_descr
Convert model weights to other formats and add them to the model description
Arguments:
- output_path: Path to save updated model package to.
- source_format: convert from a specific weights format. Default: choose automatically from any available.
- target_format: convert to a specific weights format. Default: attempt to convert to any missing format.
- devices: Devices that may be used during conversion.
- verbose: log more (error) output
Returns:
- An updated model description if any converted weights were added.
None
if no conversion was possible.
49@dataclass 50class Axis: 51 id: AxisId 52 type: Literal["batch", "channel", "index", "space", "time"] 53 54 def __post_init__(self): 55 if self.type == "batch": 56 self.id = AxisId("batch") 57 elif self.type == "channel": 58 self.id = AxisId("channel") 59 60 @classmethod 61 def create(cls, axis: AxisLike) -> Axis: 62 if isinstance(axis, cls): 63 return axis 64 elif isinstance(axis, Axis): 65 return Axis(id=axis.id, type=axis.type) 66 elif isinstance(axis, v0_5.AxisBase): 67 return Axis(id=AxisId(axis.id), type=axis.type) 68 elif isinstance(axis, str): 69 return Axis(id=AxisId(axis), type=_guess_axis_type(axis)) 70 else: 71 assert_never(axis)
60 @classmethod 61 def create(cls, axis: AxisLike) -> Axis: 62 if isinstance(axis, cls): 63 return axis 64 elif isinstance(axis, Axis): 65 return Axis(id=axis.id, type=axis.type) 66 elif isinstance(axis, v0_5.AxisBase): 67 return Axis(id=AxisId(axis.id), type=axis.type) 68 elif isinstance(axis, str): 69 return Axis(id=AxisId(axis), type=_guess_axis_type(axis)) 70 else: 71 assert_never(axis)
242class AxisId(LowerCaseIdentifier): 243 root_model: ClassVar[Type[RootModel[Any]]] = RootModel[ 244 Annotated[ 245 LowerCaseIdentifierAnno, 246 MaxLen(16), 247 AfterValidator(_normalize_axis_id), 248 ] 249 ]
str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str
Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.__str__() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.
46@dataclass(frozen=True) 47class BlockMeta: 48 """Block meta data of a sample member (a tensor in a sample) 49 50 Figure for illustration: 51 The first 2d block (dashed) of a sample member (**bold**). 52 The inner slice (thin) is expanded by a halo in both dimensions on both sides. 53 The outer slice reaches from the sample member origin (0, 0) to the right halo point. 54 55 ```terminal 56 ┌ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ┐ 57 ╷ halo(left) ╷ 58 ╷ ╷ 59 ╷ (0, 0)┏━━━━━━━━━━━━━━━━━┯━━━━━━━━━┯━━━➔ 60 ╷ ┃ │ ╷ sample member 61 ╷ ┃ inner │ ╷ 62 ╷ ┃ (and outer) │ outer ╷ 63 ╷ ┃ slice │ slice ╷ 64 ╷ ┃ │ ╷ 65 ╷ ┣─────────────────┘ ╷ 66 ╷ ┃ outer slice ╷ 67 ╷ ┃ halo(right) ╷ 68 └ ─ ─ ─ ─┃─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─┘ 69 ⬇ 70 ``` 71 72 note: 73 - Inner and outer slices are specified in sample member coordinates. 74 - The outer_slice of a block at the sample edge may overlap by more than the 75 halo with the neighboring block (the inner slices will not overlap though). 76 77 """ 78 79 sample_shape: PerAxis[int] 80 """the axis sizes of the whole (unblocked) sample""" 81 82 inner_slice: PerAxis[SliceInfo] 83 """inner region (without halo) wrt the sample""" 84 85 halo: PerAxis[Halo] 86 """halo enlarging the inner region to the block's sizes""" 87 88 block_index: BlockIndex 89 """the i-th block of the sample""" 90 91 blocks_in_sample: TotalNumberOfBlocks 92 """total number of blocks in the sample""" 93 94 @cached_property 95 def shape(self) -> PerAxis[int]: 96 """axis lengths of the block""" 97 return Frozen( 98 { 99 a: s.stop - s.start + (sum(self.halo[a]) if a in self.halo else 0) 100 for a, s in self.inner_slice.items() 101 } 102 ) 103 104 @cached_property 105 def padding(self) -> PerAxis[PadWidth]: 106 """padding to realize the halo at the sample edge 107 where we cannot simply enlarge the inner slice""" 108 return Frozen( 109 { 110 a: PadWidth( 111 ( 112 self.halo[a].left 113 - (self.inner_slice[a].start - self.outer_slice[a].start) 114 if a in self.halo 115 else 0 116 ), 117 ( 118 self.halo[a].right 119 - (self.outer_slice[a].stop - self.inner_slice[a].stop) 120 if a in self.halo 121 else 0 122 ), 123 ) 124 for a in self.inner_slice 125 } 126 ) 127 128 @cached_property 129 def outer_slice(self) -> PerAxis[SliceInfo]: 130 """slice of the outer block (without padding) wrt the sample""" 131 return Frozen( 132 { 133 a: SliceInfo( 134 max( 135 0, 136 min( 137 self.inner_slice[a].start 138 - (self.halo[a].left if a in self.halo else 0), 139 self.sample_shape[a] 140 - self.inner_shape[a] 141 - (self.halo[a].left if a in self.halo else 0), 142 ), 143 ), 144 min( 145 self.sample_shape[a], 146 self.inner_slice[a].stop 147 + (self.halo[a].right if a in self.halo else 0), 148 ), 149 ) 150 for a in self.inner_slice 151 } 152 ) 153 154 @cached_property 155 def inner_shape(self) -> PerAxis[int]: 156 """axis lengths of the inner region (without halo)""" 157 return Frozen({a: s.stop - s.start for a, s in self.inner_slice.items()}) 158 159 @cached_property 160 def local_slice(self) -> PerAxis[SliceInfo]: 161 """inner slice wrt the block, **not** the sample""" 162 return Frozen( 163 { 164 a: SliceInfo( 165 self.halo[a].left, 166 self.halo[a].left + self.inner_shape[a], 167 ) 168 for a in self.inner_slice 169 } 170 ) 171 172 @property 173 def dims(self) -> Collection[AxisId]: 174 return set(self.inner_shape) 175 176 @property 177 def tagged_shape(self) -> PerAxis[int]: 178 """alias for shape""" 179 return self.shape 180 181 @property 182 def inner_slice_wo_overlap(self): 183 """subslice of the inner slice, such that all `inner_slice_wo_overlap` can be 184 stiched together trivially to form the original sample. 185 186 This can also be used to calculate statistics 187 without overrepresenting block edge regions.""" 188 # TODO: update inner_slice_wo_overlap when adding block overlap 189 return self.inner_slice 190 191 def __post_init__(self): 192 # freeze mutable inputs 193 if not isinstance(self.sample_shape, Frozen): 194 object.__setattr__(self, "sample_shape", Frozen(self.sample_shape)) 195 196 if not isinstance(self.inner_slice, Frozen): 197 object.__setattr__(self, "inner_slice", Frozen(self.inner_slice)) 198 199 if not isinstance(self.halo, Frozen): 200 object.__setattr__(self, "halo", Frozen(self.halo)) 201 202 assert all( 203 a in self.sample_shape for a in self.inner_slice 204 ), "block has axes not present in sample" 205 206 assert all( 207 a in self.inner_slice for a in self.halo 208 ), "halo has axes not present in block" 209 210 if any(s > self.sample_shape[a] for a, s in self.shape.items()): 211 logger.warning( 212 "block {} larger than sample {}", self.shape, self.sample_shape 213 ) 214 215 def get_transformed( 216 self, new_axes: PerAxis[Union[LinearAxisTransform, int]] 217 ) -> Self: 218 return self.__class__( 219 sample_shape={ 220 a: ( 221 trf 222 if isinstance(trf, int) 223 else trf.compute(self.sample_shape[trf.axis]) 224 ) 225 for a, trf in new_axes.items() 226 }, 227 inner_slice={ 228 a: ( 229 SliceInfo(0, trf) 230 if isinstance(trf, int) 231 else SliceInfo( 232 trf.compute(self.inner_slice[trf.axis].start), 233 trf.compute(self.inner_slice[trf.axis].stop), 234 ) 235 ) 236 for a, trf in new_axes.items() 237 }, 238 halo={ 239 a: ( 240 Halo(0, 0) 241 if isinstance(trf, int) 242 else Halo(self.halo[trf.axis].left, self.halo[trf.axis].right) 243 ) 244 for a, trf in new_axes.items() 245 }, 246 block_index=self.block_index, 247 blocks_in_sample=self.blocks_in_sample, 248 )
Block meta data of a sample member (a tensor in a sample)
Figure for illustration: The first 2d block (dashed) of a sample member (bold). The inner slice (thin) is expanded by a halo in both dimensions on both sides. The outer slice reaches from the sample member origin (0, 0) to the right halo point.
┌ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ┐
╷ halo(left) ╷
╷ ╷
╷ (0, 0)┏━━━━━━━━━━━━━━━━━┯━━━━━━━━━┯━━━➔
╷ ┃ │ ╷ sample member
╷ ┃ inner │ ╷
╷ ┃ (and outer) │ outer ╷
╷ ┃ slice │ slice ╷
╷ ┃ │ ╷
╷ ┣─────────────────┘ ╷
╷ ┃ outer slice ╷
╷ ┃ halo(right) ╷
└ ─ ─ ─ ─┃─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─┘
⬇
note:
- Inner and outer slices are specified in sample member coordinates.
- The outer_slice of a block at the sample edge may overlap by more than the halo with the neighboring block (the inner slices will not overlap though).
inner region (without halo) wrt the sample
halo enlarging the inner region to the block's sizes
94 @cached_property 95 def shape(self) -> PerAxis[int]: 96 """axis lengths of the block""" 97 return Frozen( 98 { 99 a: s.stop - s.start + (sum(self.halo[a]) if a in self.halo else 0) 100 for a, s in self.inner_slice.items() 101 } 102 )
axis lengths of the block
104 @cached_property 105 def padding(self) -> PerAxis[PadWidth]: 106 """padding to realize the halo at the sample edge 107 where we cannot simply enlarge the inner slice""" 108 return Frozen( 109 { 110 a: PadWidth( 111 ( 112 self.halo[a].left 113 - (self.inner_slice[a].start - self.outer_slice[a].start) 114 if a in self.halo 115 else 0 116 ), 117 ( 118 self.halo[a].right 119 - (self.outer_slice[a].stop - self.inner_slice[a].stop) 120 if a in self.halo 121 else 0 122 ), 123 ) 124 for a in self.inner_slice 125 } 126 )
padding to realize the halo at the sample edge where we cannot simply enlarge the inner slice
128 @cached_property 129 def outer_slice(self) -> PerAxis[SliceInfo]: 130 """slice of the outer block (without padding) wrt the sample""" 131 return Frozen( 132 { 133 a: SliceInfo( 134 max( 135 0, 136 min( 137 self.inner_slice[a].start 138 - (self.halo[a].left if a in self.halo else 0), 139 self.sample_shape[a] 140 - self.inner_shape[a] 141 - (self.halo[a].left if a in self.halo else 0), 142 ), 143 ), 144 min( 145 self.sample_shape[a], 146 self.inner_slice[a].stop 147 + (self.halo[a].right if a in self.halo else 0), 148 ), 149 ) 150 for a in self.inner_slice 151 } 152 )
slice of the outer block (without padding) wrt the sample
154 @cached_property 155 def inner_shape(self) -> PerAxis[int]: 156 """axis lengths of the inner region (without halo)""" 157 return Frozen({a: s.stop - s.start for a, s in self.inner_slice.items()})
axis lengths of the inner region (without halo)
159 @cached_property 160 def local_slice(self) -> PerAxis[SliceInfo]: 161 """inner slice wrt the block, **not** the sample""" 162 return Frozen( 163 { 164 a: SliceInfo( 165 self.halo[a].left, 166 self.halo[a].left + self.inner_shape[a], 167 ) 168 for a in self.inner_slice 169 } 170 )
inner slice wrt the block, not the sample
176 @property 177 def tagged_shape(self) -> PerAxis[int]: 178 """alias for shape""" 179 return self.shape
alias for shape
181 @property 182 def inner_slice_wo_overlap(self): 183 """subslice of the inner slice, such that all `inner_slice_wo_overlap` can be 184 stiched together trivially to form the original sample. 185 186 This can also be used to calculate statistics 187 without overrepresenting block edge regions.""" 188 # TODO: update inner_slice_wo_overlap when adding block overlap 189 return self.inner_slice
subslice of the inner slice, such that all inner_slice_wo_overlap
can be
stiched together trivially to form the original sample.
This can also be used to calculate statistics without overrepresenting block edge regions.
215 def get_transformed( 216 self, new_axes: PerAxis[Union[LinearAxisTransform, int]] 217 ) -> Self: 218 return self.__class__( 219 sample_shape={ 220 a: ( 221 trf 222 if isinstance(trf, int) 223 else trf.compute(self.sample_shape[trf.axis]) 224 ) 225 for a, trf in new_axes.items() 226 }, 227 inner_slice={ 228 a: ( 229 SliceInfo(0, trf) 230 if isinstance(trf, int) 231 else SliceInfo( 232 trf.compute(self.inner_slice[trf.axis].start), 233 trf.compute(self.inner_slice[trf.axis].stop), 234 ) 235 ) 236 for a, trf in new_axes.items() 237 }, 238 halo={ 239 a: ( 240 Halo(0, 0) 241 if isinstance(trf, int) 242 else Halo(self.halo[trf.axis].left, self.halo[trf.axis].right) 243 ) 244 for a, trf in new_axes.items() 245 }, 246 block_index=self.block_index, 247 blocks_in_sample=self.blocks_in_sample, 248 )
173def build_description( 174 content: BioimageioYamlContentView, 175 /, 176 *, 177 context: Optional[ValidationContext] = None, 178 format_version: Union[FormatVersionPlaceholder, str] = DISCOVER, 179) -> Union[ResourceDescr, InvalidDescr]: 180 """build a bioimage.io resource description from an RDF's content. 181 182 Use `load_description` if you want to build a resource description from an rdf.yaml 183 or bioimage.io zip-package. 184 185 Args: 186 content: loaded rdf.yaml file (loaded with YAML, not bioimageio.spec) 187 context: validation context to use during validation 188 format_version: (optional) use this argument to load the resource and 189 convert its metadata to a higher format_version 190 191 Returns: 192 An object holding all metadata of the bioimage.io resource 193 194 """ 195 196 return build_description_impl( 197 content, 198 context=context, 199 format_version=format_version, 200 get_rd_class=_get_rd_class, 201 )
build a bioimage.io resource description from an RDF's content.
Use load_description
if you want to build a resource description from an rdf.yaml
or bioimage.io zip-package.
Arguments:
- content: loaded rdf.yaml file (loaded with YAML, not bioimageio.spec)
- context: validation context to use during validation
- format_version: (optional) use this argument to load the resource and convert its metadata to a higher format_version
Returns:
An object holding all metadata of the bioimage.io resource
578def compute_dataset_measures( 579 measures: Iterable[DatasetMeasure], dataset: Iterable[Sample] 580) -> Dict[DatasetMeasure, MeasureValue]: 581 """compute all dataset `measures` for the given `dataset`""" 582 sample_calculators, calculators = get_measure_calculators(measures) 583 assert not sample_calculators 584 585 ret: Dict[DatasetMeasure, MeasureValue] = {} 586 587 for sample in dataset: 588 for calc in calculators: 589 calc.update(sample) 590 591 for calc in calculators: 592 ret.update(calc.finalize().items()) 593 594 return ret
compute all dataset measures
for the given dataset
72 @final 73 @classmethod 74 def create( 75 cls, 76 model_description: Union[v0_4.ModelDescr, v0_5.ModelDescr], 77 *, 78 devices: Optional[Sequence[str]] = None, 79 weight_format_priority_order: Optional[Sequence[SupportedWeightsFormat]] = None, 80 ): 81 """ 82 Creates model adapter based on the passed spec 83 Note: All specific adapters should happen inside this function to prevent different framework 84 initializations interfering with each other 85 """ 86 if not isinstance(model_description, (v0_4.ModelDescr, v0_5.ModelDescr)): 87 raise TypeError( 88 f"expected v0_4.ModelDescr or v0_5.ModelDescr, but got {type(model_description)}" 89 ) 90 91 weights = model_description.weights 92 errors: List[Exception] = [] 93 weight_format_priority_order = ( 94 DEFAULT_WEIGHT_FORMAT_PRIORITY_ORDER 95 if weight_format_priority_order is None 96 else weight_format_priority_order 97 ) 98 # limit weight formats to the ones present 99 weight_format_priority_order_present: Sequence[SupportedWeightsFormat] = [ 100 w for w in weight_format_priority_order if getattr(weights, w) is not None 101 ] 102 if not weight_format_priority_order_present: 103 raise ValueError( 104 f"None of the specified weight formats ({weight_format_priority_order}) is present ({weight_format_priority_order_present})" 105 ) 106 107 for wf in weight_format_priority_order_present: 108 if wf == "pytorch_state_dict": 109 assert weights.pytorch_state_dict is not None 110 try: 111 from .pytorch_backend import PytorchModelAdapter 112 113 return PytorchModelAdapter( 114 model_description=model_description, devices=devices 115 ) 116 except Exception as e: 117 errors.append(e) 118 elif wf == "tensorflow_saved_model_bundle": 119 assert weights.tensorflow_saved_model_bundle is not None 120 try: 121 from .tensorflow_backend import create_tf_model_adapter 122 123 return create_tf_model_adapter( 124 model_description=model_description, devices=devices 125 ) 126 except Exception as e: 127 errors.append(e) 128 elif wf == "onnx": 129 assert weights.onnx is not None 130 try: 131 from .onnx_backend import ONNXModelAdapter 132 133 return ONNXModelAdapter( 134 model_description=model_description, devices=devices 135 ) 136 except Exception as e: 137 errors.append(e) 138 elif wf == "torchscript": 139 assert weights.torchscript is not None 140 try: 141 from .torchscript_backend import TorchscriptModelAdapter 142 143 return TorchscriptModelAdapter( 144 model_description=model_description, devices=devices 145 ) 146 except Exception as e: 147 errors.append(e) 148 elif wf == "keras_hdf5": 149 assert weights.keras_hdf5 is not None 150 # keras can either be installed as a separate package or used as part of tensorflow 151 # we try to first import the keras model adapter using the separate package and, 152 # if it is not available, try to load the one using tf 153 try: 154 try: 155 from .keras_backend import KerasModelAdapter 156 except Exception: 157 from .tensorflow_backend import KerasModelAdapter 158 159 return KerasModelAdapter( 160 model_description=model_description, devices=devices 161 ) 162 except Exception as e: 163 errors.append(e) 164 else: 165 assert_never(wf) 166 167 assert errors 168 if len(weight_format_priority_order) == 1: 169 assert len(errors) == 1 170 raise errors[0] 171 172 else: 173 msg = ( 174 "None of the weight format specific model adapters could be created" 175 + " in this environment." 176 ) 177 if sys.version_info[:2] >= (3, 11): 178 raise ExceptionGroup(msg, errors) 179 else: 180 raise ValueError(msg) from Exception(errors)
Creates model adapter based on the passed spec Note: All specific adapters should happen inside this function to prevent different framework initializations interfering with each other
317def create_prediction_pipeline( 318 bioimageio_model: AnyModelDescr, 319 *, 320 devices: Optional[Sequence[str]] = None, 321 weight_format: Optional[SupportedWeightsFormat] = None, 322 weights_format: Optional[SupportedWeightsFormat] = None, 323 dataset_for_initial_statistics: Iterable[Union[Sample, Sequence[Tensor]]] = tuple(), 324 keep_updating_initial_dataset_statistics: bool = False, 325 fixed_dataset_statistics: Mapping[DatasetMeasure, MeasureValue] = MappingProxyType( 326 {} 327 ), 328 model_adapter: Optional[ModelAdapter] = None, 329 ns: Optional[BlocksizeParameter] = None, 330 default_blocksize_parameter: BlocksizeParameter = 10, 331 **deprecated_kwargs: Any, 332) -> PredictionPipeline: 333 """ 334 Creates prediction pipeline which includes: 335 * computation of input statistics 336 * preprocessing 337 * model prediction 338 * computation of output statistics 339 * postprocessing 340 341 Args: 342 bioimageio_model: A bioimageio model description. 343 devices: (optional) 344 weight_format: deprecated in favor of **weights_format** 345 weights_format: (optional) Use a specific **weights_format** rather than 346 choosing one automatically. 347 A corresponding `bioimageio.core.model_adapters.ModelAdapter` will be 348 created to run inference with the **bioimageio_model**. 349 dataset_for_initial_statistics: (optional) If preprocessing steps require input 350 dataset statistics, **dataset_for_initial_statistics** allows you to 351 specifcy a dataset from which these statistics are computed. 352 keep_updating_initial_dataset_statistics: (optional) Set to `True` if you want 353 to update dataset statistics with each processed sample. 354 fixed_dataset_statistics: (optional) Allows you to specify a mapping of 355 `DatasetMeasure`s to precomputed `MeasureValue`s. 356 model_adapter: (optional) Allows you to use a custom **model_adapter** instead 357 of creating one according to the present/selected **weights_format**. 358 ns: deprecated in favor of **default_blocksize_parameter** 359 default_blocksize_parameter: Allows to control the default block size for 360 blockwise predictions, see `BlocksizeParameter`. 361 362 """ 363 weights_format = weight_format or weights_format 364 del weight_format 365 default_blocksize_parameter = ns or default_blocksize_parameter 366 del ns 367 if deprecated_kwargs: 368 warnings.warn( 369 f"deprecated create_prediction_pipeline kwargs: {set(deprecated_kwargs)}" 370 ) 371 372 model_adapter = model_adapter or create_model_adapter( 373 model_description=bioimageio_model, 374 devices=devices, 375 weight_format_priority_order=weights_format and (weights_format,), 376 ) 377 378 input_ids = get_member_ids(bioimageio_model.inputs) 379 380 def dataset(): 381 common_stat: Stat = {} 382 for i, x in enumerate(dataset_for_initial_statistics): 383 if isinstance(x, Sample): 384 yield x 385 else: 386 yield Sample(members=dict(zip(input_ids, x)), stat=common_stat, id=i) 387 388 preprocessing, postprocessing = setup_pre_and_postprocessing( 389 bioimageio_model, 390 dataset(), 391 keep_updating_initial_dataset_stats=keep_updating_initial_dataset_statistics, 392 fixed_dataset_stats=fixed_dataset_statistics, 393 ) 394 395 return PredictionPipeline( 396 name=bioimageio_model.name, 397 model_description=bioimageio_model, 398 model_adapter=model_adapter, 399 preprocessing=preprocessing, 400 postprocessing=postprocessing, 401 default_blocksize_parameter=default_blocksize_parameter, 402 )
Creates prediction pipeline which includes:
- computation of input statistics
- preprocessing
- model prediction
- computation of output statistics
- postprocessing
Arguments:
- bioimageio_model: A bioimageio model description.
- devices: (optional)
- weight_format: deprecated in favor of weights_format
- weights_format: (optional) Use a specific weights_format rather than
choosing one automatically.
A corresponding
bioimageio.core.model_adapters.ModelAdapter
will be created to run inference with the bioimageio_model. - dataset_for_initial_statistics: (optional) If preprocessing steps require input dataset statistics, dataset_for_initial_statistics allows you to specifcy a dataset from which these statistics are computed.
- keep_updating_initial_dataset_statistics: (optional) Set to
True
if you want to update dataset statistics with each processed sample. - fixed_dataset_statistics: (optional) Allows you to specify a mapping of
DatasetMeasure
s to precomputedMeasureValue
s. - model_adapter: (optional) Allows you to use a custom model_adapter instead of creating one according to the present/selected weights_format.
- ns: deprecated in favor of default_blocksize_parameter
- default_blocksize_parameter: Allows to control the default block size for
blockwise predictions, see
BlocksizeParameter
.
66def dump_description( 67 rd: Union[ResourceDescr, InvalidDescr], 68 /, 69 *, 70 exclude_unset: bool = True, 71 exclude_defaults: bool = False, 72) -> BioimageioYamlContent: 73 """Converts a resource to a dictionary containing only simple types that can directly be serialzed to YAML. 74 75 Args: 76 rd: bioimageio resource description 77 exclude_unset: Exclude fields that have not explicitly be set. 78 exclude_defaults: Exclude fields that have the default value (even if set explicitly). 79 """ 80 return rd.model_dump( 81 mode="json", exclude_unset=exclude_unset, exclude_defaults=exclude_defaults 82 )
Converts a resource to a dictionary containing only simple types that can directly be serialzed to YAML.
Arguments:
- rd: bioimageio resource description
- exclude_unset: Exclude fields that have not explicitly be set.
- exclude_defaults: Exclude fields that have the default value (even if set explicitly).
77def enable_determinism( 78 mode: Literal["seed_only", "full"] = "full", 79 weight_formats: Optional[Sequence[SupportedWeightsFormat]] = None, 80): 81 """Seed and configure ML frameworks for maximum reproducibility. 82 May degrade performance. Only recommended for testing reproducibility! 83 84 Seed any random generators and (if **mode**=="full") request ML frameworks to use 85 deterministic algorithms. 86 87 Args: 88 mode: determinism mode 89 - 'seed_only' -- only set seeds, or 90 - 'full' determinsm features (might degrade performance or throw exceptions) 91 weight_formats: Limit deep learning importing deep learning frameworks 92 based on weight_formats. 93 E.g. this allows to avoid importing tensorflow when testing with pytorch. 94 95 Notes: 96 - **mode** == "full" might degrade performance or throw exceptions. 97 - Subsequent inference calls might still differ. Call before each function 98 (sequence) that is expected to be reproducible. 99 - Degraded performance: Use for testing reproducibility only! 100 - Recipes: 101 - [PyTorch](https://pytorch.org/docs/stable/notes/randomness.html) 102 - [Keras](https://keras.io/examples/keras_recipes/reproducibility_recipes/) 103 - [NumPy](https://numpy.org/doc/2.0/reference/random/generated/numpy.random.seed.html) 104 """ 105 try: 106 try: 107 import numpy.random 108 except ImportError: 109 pass 110 else: 111 numpy.random.seed(0) 112 except Exception as e: 113 logger.debug(str(e)) 114 115 if ( 116 weight_formats is None 117 or "pytorch_state_dict" in weight_formats 118 or "torchscript" in weight_formats 119 ): 120 try: 121 try: 122 import torch 123 except ImportError: 124 pass 125 else: 126 _ = torch.manual_seed(0) 127 torch.use_deterministic_algorithms(mode == "full") 128 except Exception as e: 129 logger.debug(str(e)) 130 131 if ( 132 weight_formats is None 133 or "tensorflow_saved_model_bundle" in weight_formats 134 or "keras_hdf5" in weight_formats 135 ): 136 try: 137 os.environ["TF_ENABLE_ONEDNN_OPTS"] = "0" 138 try: 139 import tensorflow as tf # pyright: ignore[reportMissingTypeStubs] 140 except ImportError: 141 pass 142 else: 143 tf.random.set_seed(0) 144 if mode == "full": 145 tf.config.experimental.enable_op_determinism() 146 # TODO: find possibility to switch it off again?? 147 except Exception as e: 148 logger.debug(str(e)) 149 150 if weight_formats is None or "keras_hdf5" in weight_formats: 151 try: 152 try: 153 import keras # pyright: ignore[reportMissingTypeStubs] 154 except ImportError: 155 pass 156 else: 157 keras.utils.set_random_seed(0) 158 except Exception as e: 159 logger.debug(str(e))
Seed and configure ML frameworks for maximum reproducibility. May degrade performance. Only recommended for testing reproducibility!
Seed any random generators and (if mode=="full") request ML frameworks to use deterministic algorithms.
Arguments:
- mode: determinism mode
- 'seed_only' -- only set seeds, or
- 'full' determinsm features (might degrade performance or throw exceptions)
- weight_formats: Limit deep learning importing deep learning frameworks based on weight_formats. E.g. this allows to avoid importing tensorflow when testing with pytorch.
Notes:
180def load_dataset_description( 181 source: Union[PermissiveFileSource, ZipFile], 182 /, 183 *, 184 format_version: Union[FormatVersionPlaceholder, str] = DISCOVER, 185 perform_io_checks: Optional[bool] = None, 186 known_files: Optional[Dict[str, Optional[Sha256]]] = None, 187 sha256: Optional[Sha256] = None, 188) -> AnyDatasetDescr: 189 """same as `load_description`, but addtionally ensures that the loaded 190 description is valid and of type 'dataset'. 191 """ 192 rd = load_description( 193 source, 194 format_version=format_version, 195 perform_io_checks=perform_io_checks, 196 known_files=known_files, 197 sha256=sha256, 198 ) 199 return ensure_description_is_dataset(rd)
same as load_description
, but addtionally ensures that the loaded
description is valid and of type 'dataset'.
485def load_description_and_test( 486 source: Union[ResourceDescr, PermissiveFileSource, BioimageioYamlContent], 487 *, 488 format_version: Union[FormatVersionPlaceholder, str] = DISCOVER, 489 weight_format: Optional[SupportedWeightsFormat] = None, 490 devices: Optional[Sequence[str]] = None, 491 determinism: Literal["seed_only", "full"] = "seed_only", 492 expected_type: Optional[str] = None, 493 sha256: Optional[Sha256] = None, 494 stop_early: bool = False, 495 **deprecated: Unpack[DeprecatedKwargs], 496) -> Union[ResourceDescr, InvalidDescr]: 497 """Test a bioimage.io resource dynamically, 498 for example run prediction of test tensors for models. 499 500 See `test_description` for more details. 501 502 Returns: 503 A (possibly invalid) resource description object 504 with a populated `.validation_summary` attribute. 505 """ 506 if isinstance(source, ResourceDescrBase): 507 root = source.root 508 file_name = source.file_name 509 if ( 510 ( 511 format_version 512 not in ( 513 DISCOVER, 514 source.format_version, 515 ".".join(source.format_version.split(".")[:2]), 516 ) 517 ) 518 or (c := source.validation_summary.details[0].context) is None 519 or not c.perform_io_checks 520 ): 521 logger.debug( 522 "deserializing source to ensure we validate and test using format {} and perform io checks", 523 format_version, 524 ) 525 source = dump_description(source) 526 else: 527 root = Path() 528 file_name = None 529 530 if isinstance(source, ResourceDescrBase): 531 rd = source 532 elif isinstance(source, dict): 533 # check context for a given root; default to root of source 534 context = get_validation_context( 535 ValidationContext(root=root, file_name=file_name) 536 ).replace( 537 perform_io_checks=True # make sure we perform io checks though 538 ) 539 540 rd = build_description( 541 source, 542 format_version=format_version, 543 context=context, 544 ) 545 else: 546 rd = load_description( 547 source, format_version=format_version, sha256=sha256, perform_io_checks=True 548 ) 549 550 rd.validation_summary.env.add( 551 InstalledPackage(name="bioimageio.core", version=VERSION) 552 ) 553 554 if expected_type is not None: 555 _test_expected_resource_type(rd, expected_type) 556 557 if isinstance(rd, (v0_4.ModelDescr, v0_5.ModelDescr)): 558 if weight_format is None: 559 weight_formats: List[SupportedWeightsFormat] = [ 560 w for w, we in rd.weights if we is not None 561 ] # pyright: ignore[reportAssignmentType] 562 else: 563 weight_formats = [weight_format] 564 565 enable_determinism(determinism, weight_formats=weight_formats) 566 for w in weight_formats: 567 _test_model_inference(rd, w, devices, stop_early=stop_early, **deprecated) 568 if stop_early and rd.validation_summary.status == "failed": 569 break 570 571 if not isinstance(rd, v0_4.ModelDescr): 572 _test_model_inference_parametrized( 573 rd, w, devices, stop_early=stop_early 574 ) 575 if stop_early and rd.validation_summary.status == "failed": 576 break 577 578 # TODO: add execution of jupyter notebooks 579 # TODO: add more tests 580 581 if rd.validation_summary.status == "valid-format": 582 rd.validation_summary.status = "passed" 583 584 return rd
Test a bioimage.io resource dynamically, for example run prediction of test tensors for models.
See test_description
for more details.
Returns:
A (possibly invalid) resource description object with a populated
.validation_summary
attribute.
232def load_description_and_validate_format_only( 233 source: Union[PermissiveFileSource, ZipFile], 234 /, 235 *, 236 format_version: Union[FormatVersionPlaceholder, str] = DISCOVER, 237 perform_io_checks: Optional[bool] = None, 238 known_files: Optional[Dict[str, Optional[Sha256]]] = None, 239 sha256: Optional[Sha256] = None, 240) -> ValidationSummary: 241 """same as `load_description`, but only return the validation summary. 242 243 Returns: 244 Validation summary of the bioimage.io resource found at `source`. 245 246 """ 247 rd = load_description( 248 source, 249 format_version=format_version, 250 perform_io_checks=perform_io_checks, 251 known_files=known_files, 252 sha256=sha256, 253 ) 254 assert rd.validation_summary is not None 255 return rd.validation_summary
same as load_description
, but only return the validation summary.
Returns:
Validation summary of the bioimage.io resource found at
source
.
57def load_description( 58 source: Union[PermissiveFileSource, ZipFile], 59 /, 60 *, 61 format_version: Union[FormatVersionPlaceholder, str] = DISCOVER, 62 perform_io_checks: Optional[bool] = None, 63 known_files: Optional[Dict[str, Optional[Sha256]]] = None, 64 sha256: Optional[Sha256] = None, 65) -> Union[ResourceDescr, InvalidDescr]: 66 """load a bioimage.io resource description 67 68 Args: 69 source: Path or URL to an rdf.yaml or a bioimage.io package 70 (zip-file with rdf.yaml in it). 71 format_version: (optional) Use this argument to load the resource and 72 convert its metadata to a higher format_version. 73 perform_io_checks: Wether or not to perform validation that requires file io, 74 e.g. downloading a remote files. The existence of local 75 absolute file paths is still being checked. 76 known_files: Allows to bypass download and hashing of referenced files 77 (even if perform_io_checks is True). 78 Checked files will be added to this dictionary 79 with their SHA-256 value. 80 sha256: Optional SHA-256 value of **source** 81 82 Returns: 83 An object holding all metadata of the bioimage.io resource 84 85 """ 86 if isinstance(source, ResourceDescrBase): 87 name = getattr(source, "name", f"{str(source)[:10]}...") 88 logger.warning("returning already loaded description '{}' as is", name) 89 return source # pyright: ignore[reportReturnType] 90 91 opened = open_bioimageio_yaml(source, sha256=sha256) 92 93 context = get_validation_context().replace( 94 root=opened.original_root, 95 file_name=opened.original_file_name, 96 perform_io_checks=perform_io_checks, 97 known_files=known_files, 98 ) 99 100 return build_description( 101 opened.content, 102 context=context, 103 format_version=format_version, 104 )
load a bioimage.io resource description
Arguments:
- source: Path or URL to an rdf.yaml or a bioimage.io package (zip-file with rdf.yaml in it).
- format_version: (optional) Use this argument to load the resource and convert its metadata to a higher format_version.
- perform_io_checks: Wether or not to perform validation that requires file io, e.g. downloading a remote files. The existence of local absolute file paths is still being checked.
- known_files: Allows to bypass download and hashing of referenced files (even if perform_io_checks is True). Checked files will be added to this dictionary with their SHA-256 value.
- sha256: Optional SHA-256 value of source
Returns:
An object holding all metadata of the bioimage.io resource
131def load_model_description( 132 source: Union[PermissiveFileSource, ZipFile], 133 /, 134 *, 135 format_version: Union[FormatVersionPlaceholder, str] = DISCOVER, 136 perform_io_checks: Optional[bool] = None, 137 known_files: Optional[Dict[str, Optional[Sha256]]] = None, 138 sha256: Optional[Sha256] = None, 139) -> AnyModelDescr: 140 """same as `load_description`, but addtionally ensures that the loaded 141 description is valid and of type 'model'. 142 143 Raises: 144 ValueError: for invalid or non-model resources 145 """ 146 rd = load_description( 147 source, 148 format_version=format_version, 149 perform_io_checks=perform_io_checks, 150 known_files=known_files, 151 sha256=sha256, 152 ) 153 return ensure_description_is_model(rd)
same as load_description
, but addtionally ensures that the loaded
description is valid and of type 'model'.
Raises:
- ValueError: for invalid or non-model resources
131def load_model_description( 132 source: Union[PermissiveFileSource, ZipFile], 133 /, 134 *, 135 format_version: Union[FormatVersionPlaceholder, str] = DISCOVER, 136 perform_io_checks: Optional[bool] = None, 137 known_files: Optional[Dict[str, Optional[Sha256]]] = None, 138 sha256: Optional[Sha256] = None, 139) -> AnyModelDescr: 140 """same as `load_description`, but addtionally ensures that the loaded 141 description is valid and of type 'model'. 142 143 Raises: 144 ValueError: for invalid or non-model resources 145 """ 146 rd = load_description( 147 source, 148 format_version=format_version, 149 perform_io_checks=perform_io_checks, 150 known_files=known_files, 151 sha256=sha256, 152 ) 153 return ensure_description_is_model(rd)
alias of load_model_description
57def load_description( 58 source: Union[PermissiveFileSource, ZipFile], 59 /, 60 *, 61 format_version: Union[FormatVersionPlaceholder, str] = DISCOVER, 62 perform_io_checks: Optional[bool] = None, 63 known_files: Optional[Dict[str, Optional[Sha256]]] = None, 64 sha256: Optional[Sha256] = None, 65) -> Union[ResourceDescr, InvalidDescr]: 66 """load a bioimage.io resource description 67 68 Args: 69 source: Path or URL to an rdf.yaml or a bioimage.io package 70 (zip-file with rdf.yaml in it). 71 format_version: (optional) Use this argument to load the resource and 72 convert its metadata to a higher format_version. 73 perform_io_checks: Wether or not to perform validation that requires file io, 74 e.g. downloading a remote files. The existence of local 75 absolute file paths is still being checked. 76 known_files: Allows to bypass download and hashing of referenced files 77 (even if perform_io_checks is True). 78 Checked files will be added to this dictionary 79 with their SHA-256 value. 80 sha256: Optional SHA-256 value of **source** 81 82 Returns: 83 An object holding all metadata of the bioimage.io resource 84 85 """ 86 if isinstance(source, ResourceDescrBase): 87 name = getattr(source, "name", f"{str(source)[:10]}...") 88 logger.warning("returning already loaded description '{}' as is", name) 89 return source # pyright: ignore[reportReturnType] 90 91 opened = open_bioimageio_yaml(source, sha256=sha256) 92 93 context = get_validation_context().replace( 94 root=opened.original_root, 95 file_name=opened.original_file_name, 96 perform_io_checks=perform_io_checks, 97 known_files=known_files, 98 ) 99 100 return build_description( 101 opened.content, 102 context=context, 103 format_version=format_version, 104 )
alias of load_description
131def predict_many( 132 *, 133 model: Union[ 134 PermissiveFileSource, v0_4.ModelDescr, v0_5.ModelDescr, PredictionPipeline 135 ], 136 inputs: Union[Iterable[PerMember[TensorSource]], Iterable[TensorSource]], 137 sample_id: str = "sample{i:03}", 138 blocksize_parameter: Optional[ 139 Union[ 140 v0_5.ParameterizedSize_N, 141 Mapping[Tuple[MemberId, AxisId], v0_5.ParameterizedSize_N], 142 ] 143 ] = None, 144 skip_preprocessing: bool = False, 145 skip_postprocessing: bool = False, 146 save_output_path: Optional[Union[Path, str]] = None, 147) -> Iterator[Sample]: 148 """Run prediction for a multiple sets of inputs with a bioimage.io model 149 150 Args: 151 model: Model to predict with. 152 May be given as RDF source, model description or prediction pipeline. 153 inputs: An iterable of the named input(s) for this model as a dictionary. 154 sample_id: The sample id. 155 note: `{i}` will be formatted as the i-th sample. 156 If `{i}` (or `{i:`) is not present and `inputs` is not an iterable `{i:03}` 157 is appended. 158 blocksize_parameter: (optional) Tile the input into blocks parametrized by 159 blocksize according to any parametrized axis sizes defined in the model RDF. 160 skip_preprocessing: Flag to skip the model's preprocessing. 161 skip_postprocessing: Flag to skip the model's postprocessing. 162 save_output_path: A path to save the output to. 163 Must contain: 164 - `{sample_id}` to differentiate predicted samples 165 - `{output_id}` (or `{member_id}`) if the model has multiple outputs 166 """ 167 if save_output_path is not None and "{sample_id}" not in str(save_output_path): 168 raise ValueError( 169 f"Missing `{{sample_id}}` in save_output_path={save_output_path}" 170 + " to differentiate predicted samples." 171 ) 172 173 if isinstance(model, PredictionPipeline): 174 pp = model 175 else: 176 if not isinstance(model, (v0_4.ModelDescr, v0_5.ModelDescr)): 177 loaded = load_description(model) 178 if not isinstance(loaded, (v0_4.ModelDescr, v0_5.ModelDescr)): 179 raise ValueError(f"expected model description, but got {loaded}") 180 model = loaded 181 182 pp = create_prediction_pipeline(model) 183 184 if not isinstance(inputs, collections.abc.Mapping): 185 if "{i}" not in sample_id and "{i:" not in sample_id: 186 sample_id += "{i:03}" 187 188 total = len(inputs) if isinstance(inputs, collections.abc.Sized) else None 189 190 for i, ipts in tqdm(enumerate(inputs), total=total): 191 yield predict( 192 model=pp, 193 inputs=ipts, 194 sample_id=sample_id.format(i=i), 195 blocksize_parameter=blocksize_parameter, 196 skip_preprocessing=skip_preprocessing, 197 skip_postprocessing=skip_postprocessing, 198 save_output_path=save_output_path, 199 )
Run prediction for a multiple sets of inputs with a bioimage.io model
Arguments:
- model: Model to predict with. May be given as RDF source, model description or prediction pipeline.
- inputs: An iterable of the named input(s) for this model as a dictionary.
- sample_id: The sample id.
note:
{i}
will be formatted as the i-th sample. If{i}
(or{i:
) is not present andinputs
is not an iterable{i:03}
is appended. - blocksize_parameter: (optional) Tile the input into blocks parametrized by blocksize according to any parametrized axis sizes defined in the model RDF.
- skip_preprocessing: Flag to skip the model's preprocessing.
- skip_postprocessing: Flag to skip the model's postprocessing.
- save_output_path: A path to save the output to.
Must contain:
{sample_id}
to differentiate predicted samples{output_id}
(or{member_id}
) if the model has multiple outputs
29def predict( 30 *, 31 model: Union[ 32 PermissiveFileSource, v0_4.ModelDescr, v0_5.ModelDescr, PredictionPipeline 33 ], 34 inputs: Union[Sample, PerMember[TensorSource], TensorSource], 35 sample_id: Hashable = "sample", 36 blocksize_parameter: Optional[BlocksizeParameter] = None, 37 input_block_shape: Optional[Mapping[MemberId, Mapping[AxisId, int]]] = None, 38 skip_preprocessing: bool = False, 39 skip_postprocessing: bool = False, 40 save_output_path: Optional[Union[Path, str]] = None, 41) -> Sample: 42 """Run prediction for a single set of input(s) with a bioimage.io model 43 44 Args: 45 model: Model to predict with. 46 May be given as RDF source, model description or prediction pipeline. 47 inputs: the input sample or the named input(s) for this model as a dictionary 48 sample_id: the sample id. 49 The **sample_id** is used to format **save_output_path** 50 and to distinguish sample specific log messages. 51 blocksize_parameter: (optional) Tile the input into blocks parametrized by 52 **blocksize_parameter** according to any parametrized axis sizes defined 53 by the **model**. 54 See `bioimageio.spec.model.v0_5.ParameterizedSize` for details. 55 Note: For a predetermined, fixed block shape use **input_block_shape**. 56 input_block_shape: (optional) Tile the input sample tensors into blocks. 57 Note: Use **blocksize_parameter** for a parameterized block shape to 58 run prediction independent of the exact block shape. 59 skip_preprocessing: Flag to skip the model's preprocessing. 60 skip_postprocessing: Flag to skip the model's postprocessing. 61 save_output_path: A path with to save the output to. M 62 Must contain: 63 - `{output_id}` (or `{member_id}`) if the model has multiple output tensors 64 May contain: 65 - `{sample_id}` to avoid overwriting recurrent calls 66 """ 67 if isinstance(model, PredictionPipeline): 68 pp = model 69 model = pp.model_description 70 else: 71 if not isinstance(model, (v0_4.ModelDescr, v0_5.ModelDescr)): 72 loaded = load_description(model) 73 if not isinstance(loaded, (v0_4.ModelDescr, v0_5.ModelDescr)): 74 raise ValueError(f"expected model description, but got {loaded}") 75 model = loaded 76 77 pp = create_prediction_pipeline(model) 78 79 if save_output_path is not None: 80 if ( 81 "{output_id}" not in str(save_output_path) 82 and "{member_id}" not in str(save_output_path) 83 and len(model.outputs) > 1 84 ): 85 raise ValueError( 86 f"Missing `{{output_id}}` in save_output_path={save_output_path} to " 87 + "distinguish model outputs " 88 + str([get_member_id(d) for d in model.outputs]) 89 ) 90 91 if isinstance(inputs, Sample): 92 sample = inputs 93 else: 94 sample = create_sample_for_model( 95 pp.model_description, inputs=inputs, sample_id=sample_id 96 ) 97 98 if input_block_shape is not None: 99 if blocksize_parameter is not None: 100 logger.warning( 101 "ignoring blocksize_parameter={} in favor of input_block_shape={}", 102 blocksize_parameter, 103 input_block_shape, 104 ) 105 106 output = pp.predict_sample_with_fixed_blocking( 107 sample, 108 input_block_shape=input_block_shape, 109 skip_preprocessing=skip_preprocessing, 110 skip_postprocessing=skip_postprocessing, 111 ) 112 elif blocksize_parameter is not None: 113 output = pp.predict_sample_with_blocking( 114 sample, 115 skip_preprocessing=skip_preprocessing, 116 skip_postprocessing=skip_postprocessing, 117 ns=blocksize_parameter, 118 ) 119 else: 120 output = pp.predict_sample_without_blocking( 121 sample, 122 skip_preprocessing=skip_preprocessing, 123 skip_postprocessing=skip_postprocessing, 124 ) 125 if save_output_path: 126 save_sample(save_output_path, output) 127 128 return output
Run prediction for a single set of input(s) with a bioimage.io model
Arguments:
- model: Model to predict with. May be given as RDF source, model description or prediction pipeline.
- inputs: the input sample or the named input(s) for this model as a dictionary
- sample_id: the sample id. The sample_id is used to format save_output_path and to distinguish sample specific log messages.
- blocksize_parameter: (optional) Tile the input into blocks parametrized by
blocksize_parameter according to any parametrized axis sizes defined
by the model.
See
bioimageio.spec.model.v0_5.ParameterizedSize
for details. Note: For a predetermined, fixed block shape use input_block_shape. - input_block_shape: (optional) Tile the input sample tensors into blocks. Note: Use blocksize_parameter for a parameterized block shape to run prediction independent of the exact block shape.
- skip_preprocessing: Flag to skip the model's preprocessing.
- skip_postprocessing: Flag to skip the model's postprocessing.
- save_output_path: A path with to save the output to. M
Must contain:
{output_id}
(or{member_id}
) if the model has multiple output tensors May contain:{sample_id}
to avoid overwriting recurrent calls
51class PredictionPipeline: 52 """ 53 Represents model computation including preprocessing and postprocessing 54 Note: Ideally use the `PredictionPipeline` in a with statement 55 (as a context manager). 56 """ 57 58 def __init__( 59 self, 60 *, 61 name: str, 62 model_description: AnyModelDescr, 63 preprocessing: List[Processing], 64 postprocessing: List[Processing], 65 model_adapter: ModelAdapter, 66 default_ns: Optional[BlocksizeParameter] = None, 67 default_blocksize_parameter: BlocksizeParameter = 10, 68 default_batch_size: int = 1, 69 ) -> None: 70 """Use `create_prediction_pipeline` to create a `PredictionPipeline`""" 71 super().__init__() 72 default_blocksize_parameter = default_ns or default_blocksize_parameter 73 if default_ns is not None: 74 warnings.warn( 75 "Argument `default_ns` is deprecated in favor of" 76 + " `default_blocksize_paramter` and will be removed soon." 77 ) 78 del default_ns 79 80 if model_description.run_mode: 81 warnings.warn( 82 f"Not yet implemented inference for run mode '{model_description.run_mode.name}'" 83 ) 84 85 self.name = name 86 self._preprocessing = preprocessing 87 self._postprocessing = postprocessing 88 89 self.model_description = model_description 90 if isinstance(model_description, v0_4.ModelDescr): 91 self._default_input_halo: PerMember[PerAxis[Halo]] = {} 92 self._block_transform = None 93 else: 94 default_output_halo = { 95 t.id: { 96 a.id: Halo(a.halo, a.halo) 97 for a in t.axes 98 if isinstance(a, v0_5.WithHalo) 99 } 100 for t in model_description.outputs 101 } 102 self._default_input_halo = get_input_halo( 103 model_description, default_output_halo 104 ) 105 self._block_transform = get_block_transform(model_description) 106 107 self._default_blocksize_parameter = default_blocksize_parameter 108 self._default_batch_size = default_batch_size 109 110 self._input_ids = get_member_ids(model_description.inputs) 111 self._output_ids = get_member_ids(model_description.outputs) 112 113 self._adapter: ModelAdapter = model_adapter 114 115 def __enter__(self): 116 self.load() 117 return self 118 119 def __exit__(self, exc_type, exc_val, exc_tb): # type: ignore 120 self.unload() 121 return False 122 123 def predict_sample_block( 124 self, 125 sample_block: SampleBlockWithOrigin, 126 skip_preprocessing: bool = False, 127 skip_postprocessing: bool = False, 128 ) -> SampleBlock: 129 if isinstance(self.model_description, v0_4.ModelDescr): 130 raise NotImplementedError( 131 f"predict_sample_block not implemented for model {self.model_description.format_version}" 132 ) 133 else: 134 assert self._block_transform is not None 135 136 if not skip_preprocessing: 137 self.apply_preprocessing(sample_block) 138 139 output_meta = sample_block.get_transformed_meta(self._block_transform) 140 local_output = self._adapter.forward(sample_block) 141 142 output = output_meta.with_data(local_output.members, stat=local_output.stat) 143 if not skip_postprocessing: 144 self.apply_postprocessing(output) 145 146 return output 147 148 def predict_sample_without_blocking( 149 self, 150 sample: Sample, 151 skip_preprocessing: bool = False, 152 skip_postprocessing: bool = False, 153 ) -> Sample: 154 """predict a sample. 155 The sample's tensor shapes have to match the model's input tensor description. 156 If that is not the case, consider `predict_sample_with_blocking`""" 157 158 if not skip_preprocessing: 159 self.apply_preprocessing(sample) 160 161 output = self._adapter.forward(sample) 162 if not skip_postprocessing: 163 self.apply_postprocessing(output) 164 165 return output 166 167 def get_output_sample_id(self, input_sample_id: SampleId): 168 warnings.warn( 169 "`PredictionPipeline.get_output_sample_id()` is deprecated and will be" 170 + " removed soon. Output sample id is equal to input sample id, hence this" 171 + " function is not needed." 172 ) 173 return input_sample_id 174 175 def predict_sample_with_fixed_blocking( 176 self, 177 sample: Sample, 178 input_block_shape: Mapping[MemberId, Mapping[AxisId, int]], 179 *, 180 skip_preprocessing: bool = False, 181 skip_postprocessing: bool = False, 182 ) -> Sample: 183 if not skip_preprocessing: 184 self.apply_preprocessing(sample) 185 186 n_blocks, input_blocks = sample.split_into_blocks( 187 input_block_shape, 188 halo=self._default_input_halo, 189 pad_mode="reflect", 190 ) 191 input_blocks = list(input_blocks) 192 predicted_blocks: List[SampleBlock] = [] 193 logger.info( 194 "split sample shape {} into {} blocks of {}.", 195 {k: dict(v) for k, v in sample.shape.items()}, 196 n_blocks, 197 {k: dict(v) for k, v in input_block_shape.items()}, 198 ) 199 for b in tqdm( 200 input_blocks, 201 desc=f"predict {sample.id or ''} with {self.model_description.id or self.model_description.name}", 202 unit="block", 203 unit_divisor=1, 204 total=n_blocks, 205 ): 206 predicted_blocks.append( 207 self.predict_sample_block( 208 b, skip_preprocessing=True, skip_postprocessing=True 209 ) 210 ) 211 212 predicted_sample = Sample.from_blocks(predicted_blocks) 213 if not skip_postprocessing: 214 self.apply_postprocessing(predicted_sample) 215 216 return predicted_sample 217 218 def predict_sample_with_blocking( 219 self, 220 sample: Sample, 221 skip_preprocessing: bool = False, 222 skip_postprocessing: bool = False, 223 ns: Optional[ 224 Union[ 225 v0_5.ParameterizedSize_N, 226 Mapping[Tuple[MemberId, AxisId], v0_5.ParameterizedSize_N], 227 ] 228 ] = None, 229 batch_size: Optional[int] = None, 230 ) -> Sample: 231 """predict a sample by splitting it into blocks according to the model and the `ns` parameter""" 232 233 if isinstance(self.model_description, v0_4.ModelDescr): 234 raise NotImplementedError( 235 "`predict_sample_with_blocking` not implemented for v0_4.ModelDescr" 236 + f" {self.model_description.name}." 237 + " Consider using `predict_sample_with_fixed_blocking`" 238 ) 239 240 ns = ns or self._default_blocksize_parameter 241 if isinstance(ns, int): 242 ns = { 243 (ipt.id, a.id): ns 244 for ipt in self.model_description.inputs 245 for a in ipt.axes 246 if isinstance(a.size, v0_5.ParameterizedSize) 247 } 248 input_block_shape = self.model_description.get_tensor_sizes( 249 ns, batch_size or self._default_batch_size 250 ).inputs 251 252 return self.predict_sample_with_fixed_blocking( 253 sample, 254 input_block_shape=input_block_shape, 255 skip_preprocessing=skip_preprocessing, 256 skip_postprocessing=skip_postprocessing, 257 ) 258 259 # def predict( 260 # self, 261 # inputs: Predict_IO, 262 # skip_preprocessing: bool = False, 263 # skip_postprocessing: bool = False, 264 # ) -> Predict_IO: 265 # """Run model prediction **including** pre/postprocessing.""" 266 267 # if isinstance(inputs, Sample): 268 # return self.predict_sample_with_blocking( 269 # inputs, 270 # skip_preprocessing=skip_preprocessing, 271 # skip_postprocessing=skip_postprocessing, 272 # ) 273 # elif isinstance(inputs, collections.abc.Iterable): 274 # return ( 275 # self.predict( 276 # ipt, 277 # skip_preprocessing=skip_preprocessing, 278 # skip_postprocessing=skip_postprocessing, 279 # ) 280 # for ipt in inputs 281 # ) 282 # else: 283 # assert_never(inputs) 284 285 def apply_preprocessing(self, sample: Union[Sample, SampleBlockWithOrigin]) -> None: 286 """apply preprocessing in-place, also updates sample stats""" 287 for op in self._preprocessing: 288 op(sample) 289 290 def apply_postprocessing( 291 self, sample: Union[Sample, SampleBlock, SampleBlockWithOrigin] 292 ) -> None: 293 """apply postprocessing in-place, also updates samples stats""" 294 for op in self._postprocessing: 295 if isinstance(sample, (Sample, SampleBlockWithOrigin)): 296 op(sample) 297 elif not isinstance(op, BlockedOperator): 298 raise NotImplementedError( 299 "block wise update of output statistics not yet implemented" 300 ) 301 else: 302 op(sample) 303 304 def load(self): 305 """ 306 optional step: load model onto devices before calling forward if not using it as context manager 307 """ 308 pass 309 310 def unload(self): 311 """ 312 free any device memory in use 313 """ 314 self._adapter.unload()
Represents model computation including preprocessing and postprocessing
Note: Ideally use the PredictionPipeline
in a with statement
(as a context manager).
58 def __init__( 59 self, 60 *, 61 name: str, 62 model_description: AnyModelDescr, 63 preprocessing: List[Processing], 64 postprocessing: List[Processing], 65 model_adapter: ModelAdapter, 66 default_ns: Optional[BlocksizeParameter] = None, 67 default_blocksize_parameter: BlocksizeParameter = 10, 68 default_batch_size: int = 1, 69 ) -> None: 70 """Use `create_prediction_pipeline` to create a `PredictionPipeline`""" 71 super().__init__() 72 default_blocksize_parameter = default_ns or default_blocksize_parameter 73 if default_ns is not None: 74 warnings.warn( 75 "Argument `default_ns` is deprecated in favor of" 76 + " `default_blocksize_paramter` and will be removed soon." 77 ) 78 del default_ns 79 80 if model_description.run_mode: 81 warnings.warn( 82 f"Not yet implemented inference for run mode '{model_description.run_mode.name}'" 83 ) 84 85 self.name = name 86 self._preprocessing = preprocessing 87 self._postprocessing = postprocessing 88 89 self.model_description = model_description 90 if isinstance(model_description, v0_4.ModelDescr): 91 self._default_input_halo: PerMember[PerAxis[Halo]] = {} 92 self._block_transform = None 93 else: 94 default_output_halo = { 95 t.id: { 96 a.id: Halo(a.halo, a.halo) 97 for a in t.axes 98 if isinstance(a, v0_5.WithHalo) 99 } 100 for t in model_description.outputs 101 } 102 self._default_input_halo = get_input_halo( 103 model_description, default_output_halo 104 ) 105 self._block_transform = get_block_transform(model_description) 106 107 self._default_blocksize_parameter = default_blocksize_parameter 108 self._default_batch_size = default_batch_size 109 110 self._input_ids = get_member_ids(model_description.inputs) 111 self._output_ids = get_member_ids(model_description.outputs) 112 113 self._adapter: ModelAdapter = model_adapter
Use create_prediction_pipeline
to create a PredictionPipeline
123 def predict_sample_block( 124 self, 125 sample_block: SampleBlockWithOrigin, 126 skip_preprocessing: bool = False, 127 skip_postprocessing: bool = False, 128 ) -> SampleBlock: 129 if isinstance(self.model_description, v0_4.ModelDescr): 130 raise NotImplementedError( 131 f"predict_sample_block not implemented for model {self.model_description.format_version}" 132 ) 133 else: 134 assert self._block_transform is not None 135 136 if not skip_preprocessing: 137 self.apply_preprocessing(sample_block) 138 139 output_meta = sample_block.get_transformed_meta(self._block_transform) 140 local_output = self._adapter.forward(sample_block) 141 142 output = output_meta.with_data(local_output.members, stat=local_output.stat) 143 if not skip_postprocessing: 144 self.apply_postprocessing(output) 145 146 return output
148 def predict_sample_without_blocking( 149 self, 150 sample: Sample, 151 skip_preprocessing: bool = False, 152 skip_postprocessing: bool = False, 153 ) -> Sample: 154 """predict a sample. 155 The sample's tensor shapes have to match the model's input tensor description. 156 If that is not the case, consider `predict_sample_with_blocking`""" 157 158 if not skip_preprocessing: 159 self.apply_preprocessing(sample) 160 161 output = self._adapter.forward(sample) 162 if not skip_postprocessing: 163 self.apply_postprocessing(output) 164 165 return output
predict a sample.
The sample's tensor shapes have to match the model's input tensor description.
If that is not the case, consider predict_sample_with_blocking
167 def get_output_sample_id(self, input_sample_id: SampleId): 168 warnings.warn( 169 "`PredictionPipeline.get_output_sample_id()` is deprecated and will be" 170 + " removed soon. Output sample id is equal to input sample id, hence this" 171 + " function is not needed." 172 ) 173 return input_sample_id
175 def predict_sample_with_fixed_blocking( 176 self, 177 sample: Sample, 178 input_block_shape: Mapping[MemberId, Mapping[AxisId, int]], 179 *, 180 skip_preprocessing: bool = False, 181 skip_postprocessing: bool = False, 182 ) -> Sample: 183 if not skip_preprocessing: 184 self.apply_preprocessing(sample) 185 186 n_blocks, input_blocks = sample.split_into_blocks( 187 input_block_shape, 188 halo=self._default_input_halo, 189 pad_mode="reflect", 190 ) 191 input_blocks = list(input_blocks) 192 predicted_blocks: List[SampleBlock] = [] 193 logger.info( 194 "split sample shape {} into {} blocks of {}.", 195 {k: dict(v) for k, v in sample.shape.items()}, 196 n_blocks, 197 {k: dict(v) for k, v in input_block_shape.items()}, 198 ) 199 for b in tqdm( 200 input_blocks, 201 desc=f"predict {sample.id or ''} with {self.model_description.id or self.model_description.name}", 202 unit="block", 203 unit_divisor=1, 204 total=n_blocks, 205 ): 206 predicted_blocks.append( 207 self.predict_sample_block( 208 b, skip_preprocessing=True, skip_postprocessing=True 209 ) 210 ) 211 212 predicted_sample = Sample.from_blocks(predicted_blocks) 213 if not skip_postprocessing: 214 self.apply_postprocessing(predicted_sample) 215 216 return predicted_sample
218 def predict_sample_with_blocking( 219 self, 220 sample: Sample, 221 skip_preprocessing: bool = False, 222 skip_postprocessing: bool = False, 223 ns: Optional[ 224 Union[ 225 v0_5.ParameterizedSize_N, 226 Mapping[Tuple[MemberId, AxisId], v0_5.ParameterizedSize_N], 227 ] 228 ] = None, 229 batch_size: Optional[int] = None, 230 ) -> Sample: 231 """predict a sample by splitting it into blocks according to the model and the `ns` parameter""" 232 233 if isinstance(self.model_description, v0_4.ModelDescr): 234 raise NotImplementedError( 235 "`predict_sample_with_blocking` not implemented for v0_4.ModelDescr" 236 + f" {self.model_description.name}." 237 + " Consider using `predict_sample_with_fixed_blocking`" 238 ) 239 240 ns = ns or self._default_blocksize_parameter 241 if isinstance(ns, int): 242 ns = { 243 (ipt.id, a.id): ns 244 for ipt in self.model_description.inputs 245 for a in ipt.axes 246 if isinstance(a.size, v0_5.ParameterizedSize) 247 } 248 input_block_shape = self.model_description.get_tensor_sizes( 249 ns, batch_size or self._default_batch_size 250 ).inputs 251 252 return self.predict_sample_with_fixed_blocking( 253 sample, 254 input_block_shape=input_block_shape, 255 skip_preprocessing=skip_preprocessing, 256 skip_postprocessing=skip_postprocessing, 257 )
predict a sample by splitting it into blocks according to the model and the ns
parameter
285 def apply_preprocessing(self, sample: Union[Sample, SampleBlockWithOrigin]) -> None: 286 """apply preprocessing in-place, also updates sample stats""" 287 for op in self._preprocessing: 288 op(sample)
apply preprocessing in-place, also updates sample stats
290 def apply_postprocessing( 291 self, sample: Union[Sample, SampleBlock, SampleBlockWithOrigin] 292 ) -> None: 293 """apply postprocessing in-place, also updates samples stats""" 294 for op in self._postprocessing: 295 if isinstance(sample, (Sample, SampleBlockWithOrigin)): 296 op(sample) 297 elif not isinstance(op, BlockedOperator): 298 raise NotImplementedError( 299 "block wise update of output statistics not yet implemented" 300 ) 301 else: 302 op(sample)
apply postprocessing in-place, also updates samples stats
46@dataclass 47class Sample: 48 """A dataset sample. 49 50 A `Sample` has `members`, which allows to combine multiple tensors into a single 51 sample. 52 For example a `Sample` from a dataset with masked images may contain a 53 `MemberId("raw")` and `MemberId("mask")` image. 54 """ 55 56 members: Dict[MemberId, Tensor] 57 """The sample's tensors""" 58 59 stat: Stat 60 """Sample and dataset statistics""" 61 62 id: SampleId 63 """Identifies the `Sample` within the dataset -- typically a number or a string.""" 64 65 @property 66 def shape(self) -> PerMember[PerAxis[int]]: 67 return {tid: t.sizes for tid, t in self.members.items()} 68 69 def as_arrays(self) -> Dict[str, NDArray[Any]]: 70 """Return sample as dictionary of arrays.""" 71 return {str(m): t.data.to_numpy() for m, t in self.members.items()} 72 73 def split_into_blocks( 74 self, 75 block_shapes: PerMember[PerAxis[int]], 76 halo: PerMember[PerAxis[HaloLike]], 77 pad_mode: PadMode, 78 broadcast: bool = False, 79 ) -> Tuple[TotalNumberOfBlocks, Iterable[SampleBlockWithOrigin]]: 80 assert not ( 81 missing := [m for m in block_shapes if m not in self.members] 82 ), f"`block_shapes` specified for unknown members: {missing}" 83 assert not ( 84 missing := [m for m in halo if m not in block_shapes] 85 ), f"`halo` specified for members without `block_shape`: {missing}" 86 87 n_blocks, blocks = split_multiple_shapes_into_blocks( 88 shapes=self.shape, 89 block_shapes=block_shapes, 90 halo=halo, 91 broadcast=broadcast, 92 ) 93 return n_blocks, sample_block_generator(blocks, origin=self, pad_mode=pad_mode) 94 95 def as_single_block(self, halo: Optional[PerMember[PerAxis[Halo]]] = None): 96 if halo is None: 97 halo = {} 98 return SampleBlockWithOrigin( 99 sample_shape=self.shape, 100 sample_id=self.id, 101 blocks={ 102 m: Block( 103 sample_shape=self.shape[m], 104 data=data, 105 inner_slice={ 106 a: SliceInfo(0, s) for a, s in data.tagged_shape.items() 107 }, 108 halo=halo.get(m, {}), 109 block_index=0, 110 blocks_in_sample=1, 111 ) 112 for m, data in self.members.items() 113 }, 114 stat=self.stat, 115 origin=self, 116 block_index=0, 117 blocks_in_sample=1, 118 ) 119 120 @classmethod 121 def from_blocks( 122 cls, 123 sample_blocks: Iterable[SampleBlock], 124 *, 125 fill_value: float = float("nan"), 126 ) -> Self: 127 members: PerMember[Tensor] = {} 128 stat: Stat = {} 129 sample_id = None 130 for sample_block in sample_blocks: 131 assert sample_id is None or sample_id == sample_block.sample_id 132 sample_id = sample_block.sample_id 133 stat = sample_block.stat 134 for m, block in sample_block.blocks.items(): 135 if m not in members: 136 if -1 in block.sample_shape.values(): 137 raise NotImplementedError( 138 "merging blocks with data dependent axis not yet implemented" 139 ) 140 141 members[m] = Tensor( 142 np.full( 143 tuple(block.sample_shape[a] for a in block.data.dims), 144 fill_value, 145 dtype=block.data.dtype, 146 ), 147 dims=block.data.dims, 148 ) 149 150 members[m][block.inner_slice] = block.inner_data 151 152 return cls(members=members, stat=stat, id=sample_id)
A dataset sample.
A Sample
has members
, which allows to combine multiple tensors into a single
sample.
For example a Sample
from a dataset with masked images may contain a
MemberId("raw")
and MemberId("mask")
image.
Sample and dataset statistics
69 def as_arrays(self) -> Dict[str, NDArray[Any]]: 70 """Return sample as dictionary of arrays.""" 71 return {str(m): t.data.to_numpy() for m, t in self.members.items()}
Return sample as dictionary of arrays.
73 def split_into_blocks( 74 self, 75 block_shapes: PerMember[PerAxis[int]], 76 halo: PerMember[PerAxis[HaloLike]], 77 pad_mode: PadMode, 78 broadcast: bool = False, 79 ) -> Tuple[TotalNumberOfBlocks, Iterable[SampleBlockWithOrigin]]: 80 assert not ( 81 missing := [m for m in block_shapes if m not in self.members] 82 ), f"`block_shapes` specified for unknown members: {missing}" 83 assert not ( 84 missing := [m for m in halo if m not in block_shapes] 85 ), f"`halo` specified for members without `block_shape`: {missing}" 86 87 n_blocks, blocks = split_multiple_shapes_into_blocks( 88 shapes=self.shape, 89 block_shapes=block_shapes, 90 halo=halo, 91 broadcast=broadcast, 92 ) 93 return n_blocks, sample_block_generator(blocks, origin=self, pad_mode=pad_mode)
95 def as_single_block(self, halo: Optional[PerMember[PerAxis[Halo]]] = None): 96 if halo is None: 97 halo = {} 98 return SampleBlockWithOrigin( 99 sample_shape=self.shape, 100 sample_id=self.id, 101 blocks={ 102 m: Block( 103 sample_shape=self.shape[m], 104 data=data, 105 inner_slice={ 106 a: SliceInfo(0, s) for a, s in data.tagged_shape.items() 107 }, 108 halo=halo.get(m, {}), 109 block_index=0, 110 blocks_in_sample=1, 111 ) 112 for m, data in self.members.items() 113 }, 114 stat=self.stat, 115 origin=self, 116 block_index=0, 117 blocks_in_sample=1, 118 )
120 @classmethod 121 def from_blocks( 122 cls, 123 sample_blocks: Iterable[SampleBlock], 124 *, 125 fill_value: float = float("nan"), 126 ) -> Self: 127 members: PerMember[Tensor] = {} 128 stat: Stat = {} 129 sample_id = None 130 for sample_block in sample_blocks: 131 assert sample_id is None or sample_id == sample_block.sample_id 132 sample_id = sample_block.sample_id 133 stat = sample_block.stat 134 for m, block in sample_block.blocks.items(): 135 if m not in members: 136 if -1 in block.sample_shape.values(): 137 raise NotImplementedError( 138 "merging blocks with data dependent axis not yet implemented" 139 ) 140 141 members[m] = Tensor( 142 np.full( 143 tuple(block.sample_shape[a] for a in block.data.dims), 144 fill_value, 145 dtype=block.data.dtype, 146 ), 147 dims=block.data.dims, 148 ) 149 150 members[m][block.inner_slice] = block.inner_data 151 152 return cls(members=members, stat=stat, id=sample_id)
150def save_bioimageio_package_as_folder( 151 source: Union[BioimageioYamlSource, ResourceDescr], 152 /, 153 *, 154 output_path: Union[NewPath, DirectoryPath, None] = None, 155 weights_priority_order: Optional[ # model only 156 Sequence[ 157 Literal[ 158 "keras_hdf5", 159 "onnx", 160 "pytorch_state_dict", 161 "tensorflow_js", 162 "tensorflow_saved_model_bundle", 163 "torchscript", 164 ] 165 ] 166 ] = None, 167) -> DirectoryPath: 168 """Write the content of a bioimage.io resource package to a folder. 169 170 Args: 171 source: bioimageio resource description 172 output_path: file path to write package to 173 weights_priority_order: If given only the first weights format present in the model is included. 174 If none of the prioritized weights formats is found all are included. 175 176 Returns: 177 directory path to bioimageio package folder 178 """ 179 package_content = _prepare_resource_package( 180 source, 181 weights_priority_order=weights_priority_order, 182 ) 183 if output_path is None: 184 output_path = Path(mkdtemp()) 185 else: 186 output_path = Path(output_path) 187 188 output_path.mkdir(exist_ok=True, parents=True) 189 for name, src in package_content.items(): 190 if isinstance(src, collections.abc.Mapping): 191 write_yaml(src, output_path / name) 192 else: 193 with (output_path / name).open("wb") as dest: 194 _ = shutil.copyfileobj(src, dest) 195 196 return output_path
Write the content of a bioimage.io resource package to a folder.
Arguments:
- source: bioimageio resource description
- output_path: file path to write package to
- weights_priority_order: If given only the first weights format present in the model is included. If none of the prioritized weights formats is found all are included.
Returns:
directory path to bioimageio package folder
199def save_bioimageio_package( 200 source: Union[BioimageioYamlSource, ResourceDescr], 201 /, 202 *, 203 compression: int = ZIP_DEFLATED, 204 compression_level: int = 1, 205 output_path: Union[NewPath, FilePath, None] = None, 206 weights_priority_order: Optional[ # model only 207 Sequence[ 208 Literal[ 209 "keras_hdf5", 210 "onnx", 211 "pytorch_state_dict", 212 "tensorflow_js", 213 "tensorflow_saved_model_bundle", 214 "torchscript", 215 ] 216 ] 217 ] = None, 218 allow_invalid: bool = False, 219) -> FilePath: 220 """Package a bioimageio resource as a zip file. 221 222 Args: 223 rd: bioimageio resource description 224 compression: The numeric constant of compression method. 225 compression_level: Compression level to use when writing files to the archive. 226 See https://docs.python.org/3/library/zipfile.html#zipfile.ZipFile 227 output_path: file path to write package to 228 weights_priority_order: If given only the first weights format present in the model is included. 229 If none of the prioritized weights formats is found all are included. 230 231 Returns: 232 path to zipped bioimageio package 233 """ 234 package_content = _prepare_resource_package( 235 source, 236 weights_priority_order=weights_priority_order, 237 ) 238 if output_path is None: 239 output_path = Path( 240 NamedTemporaryFile(suffix=".bioimageio.zip", delete=False).name 241 ) 242 else: 243 output_path = Path(output_path) 244 245 write_zip( 246 output_path, 247 package_content, 248 compression=compression, 249 compression_level=compression_level, 250 ) 251 with get_validation_context().replace(warning_level=ERROR): 252 if isinstance((exported := load_description(output_path)), InvalidDescr): 253 exported.validation_summary.display() 254 msg = f"Exported package at '{output_path}' is invalid." 255 if allow_invalid: 256 logger.error(msg) 257 else: 258 raise ValueError(msg) 259 260 return output_path
Package a bioimageio resource as a zip file.
Arguments:
- rd: bioimageio resource description
- compression: The numeric constant of compression method.
- compression_level: Compression level to use when writing files to the archive. See https://docs.python.org/3/library/zipfile.html#zipfile.ZipFile
- output_path: file path to write package to
- weights_priority_order: If given only the first weights format present in the model is included. If none of the prioritized weights formats is found all are included.
Returns:
path to zipped bioimageio package
202def save_bioimageio_yaml_only( 203 rd: Union[ResourceDescr, BioimageioYamlContent, InvalidDescr], 204 /, 205 file: Union[NewPath, FilePath, TextIO], 206 *, 207 exclude_unset: bool = True, 208 exclude_defaults: bool = False, 209): 210 """write the metadata of a resource description (`rd`) to `file` 211 without writing any of the referenced files in it. 212 213 Args: 214 rd: bioimageio resource description 215 file: file or stream to save to 216 exclude_unset: Exclude fields that have not explicitly be set. 217 exclude_defaults: Exclude fields that have the default value (even if set explicitly). 218 219 Note: To save a resource description with its associated files as a package, 220 use `save_bioimageio_package` or `save_bioimageio_package_as_folder`. 221 """ 222 if isinstance(rd, ResourceDescrBase): 223 content = dump_description( 224 rd, exclude_unset=exclude_unset, exclude_defaults=exclude_defaults 225 ) 226 else: 227 content = rd 228 229 write_yaml(cast(YamlValue, content), file)
write the metadata of a resource description (rd
) to file
without writing any of the referenced files in it.
Arguments:
- rd: bioimageio resource description
- file: file or stream to save to
- exclude_unset: Exclude fields that have not explicitly be set.
- exclude_defaults: Exclude fields that have the default value (even if set explicitly).
Note: To save a resource description with its associated files as a package,
use save_bioimageio_package
or save_bioimageio_package_as_folder
.
49class Tensor(MagicTensorOpsMixin): 50 """A wrapper around an xr.DataArray for better integration with bioimageio.spec 51 and improved type annotations.""" 52 53 _Compatible = Union["Tensor", xr.DataArray, _ScalarOrArray] 54 55 def __init__( 56 self, 57 array: NDArray[Any], 58 dims: Sequence[Union[AxisId, AxisLike]], 59 ) -> None: 60 super().__init__() 61 axes = tuple( 62 a if isinstance(a, AxisId) else AxisInfo.create(a).id for a in dims 63 ) 64 self._data = xr.DataArray(array, dims=axes) 65 66 def __array__(self, dtype: DTypeLike = None): 67 return np.asarray(self._data, dtype=dtype) 68 69 def __getitem__( 70 self, 71 key: Union[ 72 SliceInfo, 73 slice, 74 int, 75 PerAxis[Union[SliceInfo, slice, int]], 76 Tensor, 77 xr.DataArray, 78 ], 79 ) -> Self: 80 if isinstance(key, SliceInfo): 81 key = slice(*key) 82 elif isinstance(key, collections.abc.Mapping): 83 key = { 84 a: s if isinstance(s, int) else s if isinstance(s, slice) else slice(*s) 85 for a, s in key.items() 86 } 87 elif isinstance(key, Tensor): 88 key = key._data 89 90 return self.__class__.from_xarray(self._data[key]) 91 92 def __setitem__( 93 self, 94 key: Union[PerAxis[Union[SliceInfo, slice]], Tensor, xr.DataArray], 95 value: Union[Tensor, xr.DataArray, float, int], 96 ) -> None: 97 if isinstance(key, Tensor): 98 key = key._data 99 elif isinstance(key, xr.DataArray): 100 pass 101 else: 102 key = {a: s if isinstance(s, slice) else slice(*s) for a, s in key.items()} 103 104 if isinstance(value, Tensor): 105 value = value._data 106 107 self._data[key] = value 108 109 def __len__(self) -> int: 110 return len(self.data) 111 112 def _iter(self: Any) -> Iterator[Any]: 113 for n in range(len(self)): 114 yield self[n] 115 116 def __iter__(self: Any) -> Iterator[Any]: 117 if self.ndim == 0: 118 raise TypeError("iteration over a 0-d array") 119 return self._iter() 120 121 def _binary_op( 122 self, 123 other: _Compatible, 124 f: Callable[[Any, Any], Any], 125 reflexive: bool = False, 126 ) -> Self: 127 data = self._data._binary_op( # pyright: ignore[reportPrivateUsage] 128 (other._data if isinstance(other, Tensor) else other), 129 f, 130 reflexive, 131 ) 132 return self.__class__.from_xarray(data) 133 134 def _inplace_binary_op( 135 self, 136 other: _Compatible, 137 f: Callable[[Any, Any], Any], 138 ) -> Self: 139 _ = self._data._inplace_binary_op( # pyright: ignore[reportPrivateUsage] 140 ( 141 other_d 142 if (other_d := getattr(other, "data")) is not None 143 and isinstance( 144 other_d, 145 xr.DataArray, 146 ) 147 else other 148 ), 149 f, 150 ) 151 return self 152 153 def _unary_op(self, f: Callable[[Any], Any], *args: Any, **kwargs: Any) -> Self: 154 data = self._data._unary_op( # pyright: ignore[reportPrivateUsage] 155 f, *args, **kwargs 156 ) 157 return self.__class__.from_xarray(data) 158 159 @classmethod 160 def from_xarray(cls, data_array: xr.DataArray) -> Self: 161 """create a `Tensor` from an xarray data array 162 163 note for internal use: this factory method is round-trip save 164 for any `Tensor`'s `data` property (an xarray.DataArray). 165 """ 166 return cls( 167 array=data_array.data, dims=tuple(AxisId(d) for d in data_array.dims) 168 ) 169 170 @classmethod 171 def from_numpy( 172 cls, 173 array: NDArray[Any], 174 *, 175 dims: Optional[Union[AxisLike, Sequence[AxisLike]]], 176 ) -> Tensor: 177 """create a `Tensor` from a numpy array 178 179 Args: 180 array: the nd numpy array 181 axes: A description of the array's axes, 182 if None axes are guessed (which might fail and raise a ValueError.) 183 184 Raises: 185 ValueError: if `axes` is None and axes guessing fails. 186 """ 187 188 if dims is None: 189 return cls._interprete_array_wo_known_axes(array) 190 elif isinstance(dims, (str, Axis, v0_5.AxisBase)): 191 dims = [dims] 192 193 axis_infos = [AxisInfo.create(a) for a in dims] 194 original_shape = tuple(array.shape) 195 196 successful_view = _get_array_view(array, axis_infos) 197 if successful_view is None: 198 raise ValueError( 199 f"Array shape {original_shape} does not map to axes {dims}" 200 ) 201 202 return Tensor(successful_view, dims=tuple(a.id for a in axis_infos)) 203 204 @property 205 def data(self): 206 return self._data 207 208 @property 209 def dims(self): # TODO: rename to `axes`? 210 """Tuple of dimension names associated with this tensor.""" 211 return cast(Tuple[AxisId, ...], self._data.dims) 212 213 @property 214 def dtype(self) -> DTypeStr: 215 dt = str(self.data.dtype) # pyright: ignore[reportUnknownArgumentType] 216 assert dt in get_args(DTypeStr) 217 return dt # pyright: ignore[reportReturnType] 218 219 @property 220 def ndim(self): 221 """Number of tensor dimensions.""" 222 return self._data.ndim 223 224 @property 225 def shape(self): 226 """Tuple of tensor axes lengths""" 227 return self._data.shape 228 229 @property 230 def shape_tuple(self): 231 """Tuple of tensor axes lengths""" 232 return self._data.shape 233 234 @property 235 def size(self): 236 """Number of elements in the tensor. 237 238 Equal to math.prod(tensor.shape), i.e., the product of the tensors’ dimensions. 239 """ 240 return self._data.size 241 242 @property 243 def sizes(self): 244 """Ordered, immutable mapping from axis ids to axis lengths.""" 245 return cast(Mapping[AxisId, int], self.data.sizes) 246 247 @property 248 def tagged_shape(self): 249 """(alias for `sizes`) Ordered, immutable mapping from axis ids to lengths.""" 250 return self.sizes 251 252 def argmax(self) -> Mapping[AxisId, int]: 253 ret = self._data.argmax(...) 254 assert isinstance(ret, dict) 255 return {cast(AxisId, k): cast(int, v.item()) for k, v in ret.items()} 256 257 def astype(self, dtype: DTypeStr, *, copy: bool = False): 258 """Return tensor cast to `dtype` 259 260 note: if dtype is already satisfied copy if `copy`""" 261 return self.__class__.from_xarray(self._data.astype(dtype, copy=copy)) 262 263 def clip(self, min: Optional[float] = None, max: Optional[float] = None): 264 """Return a tensor whose values are limited to [min, max]. 265 At least one of max or min must be given.""" 266 return self.__class__.from_xarray(self._data.clip(min, max)) 267 268 def crop_to( 269 self, 270 sizes: PerAxis[int], 271 crop_where: Union[ 272 CropWhere, 273 PerAxis[CropWhere], 274 ] = "left_and_right", 275 ) -> Self: 276 """crop to match `sizes`""" 277 if isinstance(crop_where, str): 278 crop_axis_where: PerAxis[CropWhere] = {a: crop_where for a in self.dims} 279 else: 280 crop_axis_where = crop_where 281 282 slices: Dict[AxisId, SliceInfo] = {} 283 284 for a, s_is in self.sizes.items(): 285 if a not in sizes or sizes[a] == s_is: 286 pass 287 elif sizes[a] > s_is: 288 logger.warning( 289 "Cannot crop axis {} of size {} to larger size {}", 290 a, 291 s_is, 292 sizes[a], 293 ) 294 elif a not in crop_axis_where: 295 raise ValueError( 296 f"Don't know where to crop axis {a}, `crop_where`={crop_where}" 297 ) 298 else: 299 crop_this_axis_where = crop_axis_where[a] 300 if crop_this_axis_where == "left": 301 slices[a] = SliceInfo(s_is - sizes[a], s_is) 302 elif crop_this_axis_where == "right": 303 slices[a] = SliceInfo(0, sizes[a]) 304 elif crop_this_axis_where == "left_and_right": 305 slices[a] = SliceInfo( 306 start := (s_is - sizes[a]) // 2, sizes[a] + start 307 ) 308 else: 309 assert_never(crop_this_axis_where) 310 311 return self[slices] 312 313 def expand_dims(self, dims: Union[Sequence[AxisId], PerAxis[int]]) -> Self: 314 return self.__class__.from_xarray(self._data.expand_dims(dims=dims)) 315 316 def item( 317 self, 318 key: Union[ 319 None, SliceInfo, slice, int, PerAxis[Union[SliceInfo, slice, int]] 320 ] = None, 321 ): 322 """Copy a tensor element to a standard Python scalar and return it.""" 323 if key is None: 324 ret = self._data.item() 325 else: 326 ret = self[key]._data.item() 327 328 assert isinstance(ret, (bool, float, int)) 329 return ret 330 331 def mean(self, dim: Optional[Union[AxisId, Sequence[AxisId]]] = None) -> Self: 332 return self.__class__.from_xarray(self._data.mean(dim=dim)) 333 334 def pad( 335 self, 336 pad_width: PerAxis[PadWidthLike], 337 mode: PadMode = "symmetric", 338 ) -> Self: 339 pad_width = {a: PadWidth.create(p) for a, p in pad_width.items()} 340 return self.__class__.from_xarray( 341 self._data.pad(pad_width=pad_width, mode=mode) 342 ) 343 344 def pad_to( 345 self, 346 sizes: PerAxis[int], 347 pad_where: Union[PadWhere, PerAxis[PadWhere]] = "left_and_right", 348 mode: PadMode = "symmetric", 349 ) -> Self: 350 """pad `tensor` to match `sizes`""" 351 if isinstance(pad_where, str): 352 pad_axis_where: PerAxis[PadWhere] = {a: pad_where for a in self.dims} 353 else: 354 pad_axis_where = pad_where 355 356 pad_width: Dict[AxisId, PadWidth] = {} 357 for a, s_is in self.sizes.items(): 358 if a not in sizes or sizes[a] == s_is: 359 pad_width[a] = PadWidth(0, 0) 360 elif s_is > sizes[a]: 361 pad_width[a] = PadWidth(0, 0) 362 logger.warning( 363 "Cannot pad axis {} of size {} to smaller size {}", 364 a, 365 s_is, 366 sizes[a], 367 ) 368 elif a not in pad_axis_where: 369 raise ValueError( 370 f"Don't know where to pad axis {a}, `pad_where`={pad_where}" 371 ) 372 else: 373 pad_this_axis_where = pad_axis_where[a] 374 d = sizes[a] - s_is 375 if pad_this_axis_where == "left": 376 pad_width[a] = PadWidth(d, 0) 377 elif pad_this_axis_where == "right": 378 pad_width[a] = PadWidth(0, d) 379 elif pad_this_axis_where == "left_and_right": 380 pad_width[a] = PadWidth(left := d // 2, d - left) 381 else: 382 assert_never(pad_this_axis_where) 383 384 return self.pad(pad_width, mode) 385 386 def quantile( 387 self, 388 q: Union[float, Sequence[float]], 389 dim: Optional[Union[AxisId, Sequence[AxisId]]] = None, 390 ) -> Self: 391 assert ( 392 isinstance(q, (float, int)) 393 and q >= 0.0 394 or not isinstance(q, (float, int)) 395 and all(qq >= 0.0 for qq in q) 396 ) 397 assert ( 398 isinstance(q, (float, int)) 399 and q <= 1.0 400 or not isinstance(q, (float, int)) 401 and all(qq <= 1.0 for qq in q) 402 ) 403 assert dim is None or ( 404 (quantile_dim := AxisId("quantile")) != dim and quantile_dim not in set(dim) 405 ) 406 return self.__class__.from_xarray(self._data.quantile(q, dim=dim)) 407 408 def resize_to( 409 self, 410 sizes: PerAxis[int], 411 *, 412 pad_where: Union[ 413 PadWhere, 414 PerAxis[PadWhere], 415 ] = "left_and_right", 416 crop_where: Union[ 417 CropWhere, 418 PerAxis[CropWhere], 419 ] = "left_and_right", 420 pad_mode: PadMode = "symmetric", 421 ): 422 """return cropped/padded tensor with `sizes`""" 423 crop_to_sizes: Dict[AxisId, int] = {} 424 pad_to_sizes: Dict[AxisId, int] = {} 425 new_axes = dict(sizes) 426 for a, s_is in self.sizes.items(): 427 a = AxisId(str(a)) 428 _ = new_axes.pop(a, None) 429 if a not in sizes or sizes[a] == s_is: 430 pass 431 elif s_is > sizes[a]: 432 crop_to_sizes[a] = sizes[a] 433 else: 434 pad_to_sizes[a] = sizes[a] 435 436 tensor = self 437 if crop_to_sizes: 438 tensor = tensor.crop_to(crop_to_sizes, crop_where=crop_where) 439 440 if pad_to_sizes: 441 tensor = tensor.pad_to(pad_to_sizes, pad_where=pad_where, mode=pad_mode) 442 443 if new_axes: 444 tensor = tensor.expand_dims(new_axes) 445 446 return tensor 447 448 def std(self, dim: Optional[Union[AxisId, Sequence[AxisId]]] = None) -> Self: 449 return self.__class__.from_xarray(self._data.std(dim=dim)) 450 451 def sum(self, dim: Optional[Union[AxisId, Sequence[AxisId]]] = None) -> Self: 452 """Reduce this Tensor's data by applying sum along some dimension(s).""" 453 return self.__class__.from_xarray(self._data.sum(dim=dim)) 454 455 def transpose( 456 self, 457 axes: Sequence[AxisId], 458 ) -> Self: 459 """return a transposed tensor 460 461 Args: 462 axes: the desired tensor axes 463 """ 464 # expand missing tensor axes 465 missing_axes = tuple(a for a in axes if a not in self.dims) 466 array = self._data 467 if missing_axes: 468 array = array.expand_dims(missing_axes) 469 470 # transpose to the correct axis order 471 return self.__class__.from_xarray(array.transpose(*axes)) 472 473 def var(self, dim: Optional[Union[AxisId, Sequence[AxisId]]] = None) -> Self: 474 return self.__class__.from_xarray(self._data.var(dim=dim)) 475 476 @classmethod 477 def _interprete_array_wo_known_axes(cls, array: NDArray[Any]): 478 ndim = array.ndim 479 if ndim == 2: 480 current_axes = ( 481 v0_5.SpaceInputAxis(id=AxisId("y"), size=array.shape[0]), 482 v0_5.SpaceInputAxis(id=AxisId("x"), size=array.shape[1]), 483 ) 484 elif ndim == 3 and any(s <= 3 for s in array.shape): 485 current_axes = ( 486 v0_5.ChannelAxis( 487 channel_names=[ 488 v0_5.Identifier(f"channel{i}") for i in range(array.shape[0]) 489 ] 490 ), 491 v0_5.SpaceInputAxis(id=AxisId("y"), size=array.shape[1]), 492 v0_5.SpaceInputAxis(id=AxisId("x"), size=array.shape[2]), 493 ) 494 elif ndim == 3: 495 current_axes = ( 496 v0_5.SpaceInputAxis(id=AxisId("z"), size=array.shape[0]), 497 v0_5.SpaceInputAxis(id=AxisId("y"), size=array.shape[1]), 498 v0_5.SpaceInputAxis(id=AxisId("x"), size=array.shape[2]), 499 ) 500 elif ndim == 4: 501 current_axes = ( 502 v0_5.ChannelAxis( 503 channel_names=[ 504 v0_5.Identifier(f"channel{i}") for i in range(array.shape[0]) 505 ] 506 ), 507 v0_5.SpaceInputAxis(id=AxisId("z"), size=array.shape[1]), 508 v0_5.SpaceInputAxis(id=AxisId("y"), size=array.shape[2]), 509 v0_5.SpaceInputAxis(id=AxisId("x"), size=array.shape[3]), 510 ) 511 elif ndim == 5: 512 current_axes = ( 513 v0_5.BatchAxis(), 514 v0_5.ChannelAxis( 515 channel_names=[ 516 v0_5.Identifier(f"channel{i}") for i in range(array.shape[1]) 517 ] 518 ), 519 v0_5.SpaceInputAxis(id=AxisId("z"), size=array.shape[2]), 520 v0_5.SpaceInputAxis(id=AxisId("y"), size=array.shape[3]), 521 v0_5.SpaceInputAxis(id=AxisId("x"), size=array.shape[4]), 522 ) 523 else: 524 raise ValueError(f"Could not guess an axis mapping for {array.shape}") 525 526 return cls(array, dims=tuple(a.id for a in current_axes))
A wrapper around an xr.DataArray for better integration with bioimageio.spec and improved type annotations.
159 @classmethod 160 def from_xarray(cls, data_array: xr.DataArray) -> Self: 161 """create a `Tensor` from an xarray data array 162 163 note for internal use: this factory method is round-trip save 164 for any `Tensor`'s `data` property (an xarray.DataArray). 165 """ 166 return cls( 167 array=data_array.data, dims=tuple(AxisId(d) for d in data_array.dims) 168 )
170 @classmethod 171 def from_numpy( 172 cls, 173 array: NDArray[Any], 174 *, 175 dims: Optional[Union[AxisLike, Sequence[AxisLike]]], 176 ) -> Tensor: 177 """create a `Tensor` from a numpy array 178 179 Args: 180 array: the nd numpy array 181 axes: A description of the array's axes, 182 if None axes are guessed (which might fail and raise a ValueError.) 183 184 Raises: 185 ValueError: if `axes` is None and axes guessing fails. 186 """ 187 188 if dims is None: 189 return cls._interprete_array_wo_known_axes(array) 190 elif isinstance(dims, (str, Axis, v0_5.AxisBase)): 191 dims = [dims] 192 193 axis_infos = [AxisInfo.create(a) for a in dims] 194 original_shape = tuple(array.shape) 195 196 successful_view = _get_array_view(array, axis_infos) 197 if successful_view is None: 198 raise ValueError( 199 f"Array shape {original_shape} does not map to axes {dims}" 200 ) 201 202 return Tensor(successful_view, dims=tuple(a.id for a in axis_infos))
create a Tensor
from a numpy array
Arguments:
- array: the nd numpy array
- axes: A description of the array's axes, if None axes are guessed (which might fail and raise a ValueError.)
Raises:
- ValueError: if
axes
is None and axes guessing fails.
208 @property 209 def dims(self): # TODO: rename to `axes`? 210 """Tuple of dimension names associated with this tensor.""" 211 return cast(Tuple[AxisId, ...], self._data.dims)
Tuple of dimension names associated with this tensor.
224 @property 225 def shape(self): 226 """Tuple of tensor axes lengths""" 227 return self._data.shape
Tuple of tensor axes lengths
229 @property 230 def shape_tuple(self): 231 """Tuple of tensor axes lengths""" 232 return self._data.shape
Tuple of tensor axes lengths
234 @property 235 def size(self): 236 """Number of elements in the tensor. 237 238 Equal to math.prod(tensor.shape), i.e., the product of the tensors’ dimensions. 239 """ 240 return self._data.size
Number of elements in the tensor.
Equal to math.prod(tensor.shape), i.e., the product of the tensors’ dimensions.
242 @property 243 def sizes(self): 244 """Ordered, immutable mapping from axis ids to axis lengths.""" 245 return cast(Mapping[AxisId, int], self.data.sizes)
Ordered, immutable mapping from axis ids to axis lengths.
247 @property 248 def tagged_shape(self): 249 """(alias for `sizes`) Ordered, immutable mapping from axis ids to lengths.""" 250 return self.sizes
(alias for sizes
) Ordered, immutable mapping from axis ids to lengths.
257 def astype(self, dtype: DTypeStr, *, copy: bool = False): 258 """Return tensor cast to `dtype` 259 260 note: if dtype is already satisfied copy if `copy`""" 261 return self.__class__.from_xarray(self._data.astype(dtype, copy=copy))
Return tensor cast to dtype
note: if dtype is already satisfied copy if copy
263 def clip(self, min: Optional[float] = None, max: Optional[float] = None): 264 """Return a tensor whose values are limited to [min, max]. 265 At least one of max or min must be given.""" 266 return self.__class__.from_xarray(self._data.clip(min, max))
Return a tensor whose values are limited to [min, max]. At least one of max or min must be given.
268 def crop_to( 269 self, 270 sizes: PerAxis[int], 271 crop_where: Union[ 272 CropWhere, 273 PerAxis[CropWhere], 274 ] = "left_and_right", 275 ) -> Self: 276 """crop to match `sizes`""" 277 if isinstance(crop_where, str): 278 crop_axis_where: PerAxis[CropWhere] = {a: crop_where for a in self.dims} 279 else: 280 crop_axis_where = crop_where 281 282 slices: Dict[AxisId, SliceInfo] = {} 283 284 for a, s_is in self.sizes.items(): 285 if a not in sizes or sizes[a] == s_is: 286 pass 287 elif sizes[a] > s_is: 288 logger.warning( 289 "Cannot crop axis {} of size {} to larger size {}", 290 a, 291 s_is, 292 sizes[a], 293 ) 294 elif a not in crop_axis_where: 295 raise ValueError( 296 f"Don't know where to crop axis {a}, `crop_where`={crop_where}" 297 ) 298 else: 299 crop_this_axis_where = crop_axis_where[a] 300 if crop_this_axis_where == "left": 301 slices[a] = SliceInfo(s_is - sizes[a], s_is) 302 elif crop_this_axis_where == "right": 303 slices[a] = SliceInfo(0, sizes[a]) 304 elif crop_this_axis_where == "left_and_right": 305 slices[a] = SliceInfo( 306 start := (s_is - sizes[a]) // 2, sizes[a] + start 307 ) 308 else: 309 assert_never(crop_this_axis_where) 310 311 return self[slices]
crop to match sizes
316 def item( 317 self, 318 key: Union[ 319 None, SliceInfo, slice, int, PerAxis[Union[SliceInfo, slice, int]] 320 ] = None, 321 ): 322 """Copy a tensor element to a standard Python scalar and return it.""" 323 if key is None: 324 ret = self._data.item() 325 else: 326 ret = self[key]._data.item() 327 328 assert isinstance(ret, (bool, float, int)) 329 return ret
Copy a tensor element to a standard Python scalar and return it.
344 def pad_to( 345 self, 346 sizes: PerAxis[int], 347 pad_where: Union[PadWhere, PerAxis[PadWhere]] = "left_and_right", 348 mode: PadMode = "symmetric", 349 ) -> Self: 350 """pad `tensor` to match `sizes`""" 351 if isinstance(pad_where, str): 352 pad_axis_where: PerAxis[PadWhere] = {a: pad_where for a in self.dims} 353 else: 354 pad_axis_where = pad_where 355 356 pad_width: Dict[AxisId, PadWidth] = {} 357 for a, s_is in self.sizes.items(): 358 if a not in sizes or sizes[a] == s_is: 359 pad_width[a] = PadWidth(0, 0) 360 elif s_is > sizes[a]: 361 pad_width[a] = PadWidth(0, 0) 362 logger.warning( 363 "Cannot pad axis {} of size {} to smaller size {}", 364 a, 365 s_is, 366 sizes[a], 367 ) 368 elif a not in pad_axis_where: 369 raise ValueError( 370 f"Don't know where to pad axis {a}, `pad_where`={pad_where}" 371 ) 372 else: 373 pad_this_axis_where = pad_axis_where[a] 374 d = sizes[a] - s_is 375 if pad_this_axis_where == "left": 376 pad_width[a] = PadWidth(d, 0) 377 elif pad_this_axis_where == "right": 378 pad_width[a] = PadWidth(0, d) 379 elif pad_this_axis_where == "left_and_right": 380 pad_width[a] = PadWidth(left := d // 2, d - left) 381 else: 382 assert_never(pad_this_axis_where) 383 384 return self.pad(pad_width, mode)
pad tensor
to match sizes
386 def quantile( 387 self, 388 q: Union[float, Sequence[float]], 389 dim: Optional[Union[AxisId, Sequence[AxisId]]] = None, 390 ) -> Self: 391 assert ( 392 isinstance(q, (float, int)) 393 and q >= 0.0 394 or not isinstance(q, (float, int)) 395 and all(qq >= 0.0 for qq in q) 396 ) 397 assert ( 398 isinstance(q, (float, int)) 399 and q <= 1.0 400 or not isinstance(q, (float, int)) 401 and all(qq <= 1.0 for qq in q) 402 ) 403 assert dim is None or ( 404 (quantile_dim := AxisId("quantile")) != dim and quantile_dim not in set(dim) 405 ) 406 return self.__class__.from_xarray(self._data.quantile(q, dim=dim))
408 def resize_to( 409 self, 410 sizes: PerAxis[int], 411 *, 412 pad_where: Union[ 413 PadWhere, 414 PerAxis[PadWhere], 415 ] = "left_and_right", 416 crop_where: Union[ 417 CropWhere, 418 PerAxis[CropWhere], 419 ] = "left_and_right", 420 pad_mode: PadMode = "symmetric", 421 ): 422 """return cropped/padded tensor with `sizes`""" 423 crop_to_sizes: Dict[AxisId, int] = {} 424 pad_to_sizes: Dict[AxisId, int] = {} 425 new_axes = dict(sizes) 426 for a, s_is in self.sizes.items(): 427 a = AxisId(str(a)) 428 _ = new_axes.pop(a, None) 429 if a not in sizes or sizes[a] == s_is: 430 pass 431 elif s_is > sizes[a]: 432 crop_to_sizes[a] = sizes[a] 433 else: 434 pad_to_sizes[a] = sizes[a] 435 436 tensor = self 437 if crop_to_sizes: 438 tensor = tensor.crop_to(crop_to_sizes, crop_where=crop_where) 439 440 if pad_to_sizes: 441 tensor = tensor.pad_to(pad_to_sizes, pad_where=pad_where, mode=pad_mode) 442 443 if new_axes: 444 tensor = tensor.expand_dims(new_axes) 445 446 return tensor
return cropped/padded tensor with sizes
451 def sum(self, dim: Optional[Union[AxisId, Sequence[AxisId]]] = None) -> Self: 452 """Reduce this Tensor's data by applying sum along some dimension(s).""" 453 return self.__class__.from_xarray(self._data.sum(dim=dim))
Reduce this Tensor's data by applying sum along some dimension(s).
455 def transpose( 456 self, 457 axes: Sequence[AxisId], 458 ) -> Self: 459 """return a transposed tensor 460 461 Args: 462 axes: the desired tensor axes 463 """ 464 # expand missing tensor axes 465 missing_axes = tuple(a for a in axes if a not in self.dims) 466 array = self._data 467 if missing_axes: 468 array = array.expand_dims(missing_axes) 469 470 # transpose to the correct axis order 471 return self.__class__.from_xarray(array.transpose(*axes))
return a transposed tensor
Arguments:
- axes: the desired tensor axes
190def test_description( 191 source: Union[ResourceDescr, PermissiveFileSource, BioimageioYamlContent], 192 *, 193 format_version: Union[FormatVersionPlaceholder, str] = "discover", 194 weight_format: Optional[SupportedWeightsFormat] = None, 195 devices: Optional[Sequence[str]] = None, 196 determinism: Literal["seed_only", "full"] = "seed_only", 197 expected_type: Optional[str] = None, 198 sha256: Optional[Sha256] = None, 199 stop_early: bool = False, 200 runtime_env: Union[ 201 Literal["currently-active", "as-described"], Path, BioimageioCondaEnv 202 ] = ("currently-active"), 203 run_command: Callable[[Sequence[str]], None] = default_run_command, 204 **deprecated: Unpack[DeprecatedKwargs], 205) -> ValidationSummary: 206 """Test a bioimage.io resource dynamically, 207 for example run prediction of test tensors for models. 208 209 Args: 210 source: model description source. 211 weight_format: Weight format to test. 212 Default: All weight formats present in **source**. 213 devices: Devices to test with, e.g. 'cpu', 'cuda'. 214 Default (may be weight format dependent): ['cuda'] if available, ['cpu'] otherwise. 215 determinism: Modes to improve reproducibility of test outputs. 216 expected_type: Assert an expected resource description `type`. 217 sha256: Expected SHA256 value of **source**. 218 (Ignored if **source** already is a loaded `ResourceDescr` object.) 219 stop_early: Do not run further subtests after a failed one. 220 runtime_env: (Experimental feature!) The Python environment to run the tests in 221 - `"currently-active"`: Use active Python interpreter. 222 - `"as-described"`: Use `bioimageio.spec.get_conda_env` to generate a conda 223 environment YAML file based on the model weights description. 224 - A `BioimageioCondaEnv` or a path to a conda environment YAML file. 225 Note: The `bioimageio.core` dependency will be added automatically if not present. 226 run_command: (Experimental feature!) Function to execute (conda) terminal commands in a subprocess 227 (ignored if **runtime_env** is `"currently-active"`). 228 """ 229 if runtime_env == "currently-active": 230 rd = load_description_and_test( 231 source, 232 format_version=format_version, 233 weight_format=weight_format, 234 devices=devices, 235 determinism=determinism, 236 expected_type=expected_type, 237 sha256=sha256, 238 stop_early=stop_early, 239 **deprecated, 240 ) 241 return rd.validation_summary 242 243 if runtime_env == "as-described": 244 conda_env = None 245 elif isinstance(runtime_env, (str, Path)): 246 conda_env = BioimageioCondaEnv.model_validate(read_yaml(Path(runtime_env))) 247 elif isinstance(runtime_env, BioimageioCondaEnv): 248 conda_env = runtime_env 249 else: 250 assert_never(runtime_env) 251 252 with TemporaryDirectory(ignore_cleanup_errors=True) as _d: 253 working_dir = Path(_d) 254 if isinstance(source, (dict, ResourceDescrBase)): 255 file_source = save_bioimageio_package( 256 source, output_path=working_dir / "package.zip" 257 ) 258 else: 259 file_source = source 260 261 return _test_in_env( 262 file_source, 263 working_dir=working_dir, 264 weight_format=weight_format, 265 conda_env=conda_env, 266 devices=devices, 267 determinism=determinism, 268 expected_type=expected_type, 269 sha256=sha256, 270 stop_early=stop_early, 271 run_command=run_command, 272 **deprecated, 273 )
Test a bioimage.io resource dynamically, for example run prediction of test tensors for models.
Arguments:
- source: model description source.
- weight_format: Weight format to test. Default: All weight formats present in source.
- devices: Devices to test with, e.g. 'cpu', 'cuda'. Default (may be weight format dependent): ['cuda'] if available, ['cpu'] otherwise.
- determinism: Modes to improve reproducibility of test outputs.
- expected_type: Assert an expected resource description
type
. - sha256: Expected SHA256 value of source.
(Ignored if source already is a loaded
ResourceDescr
object.) - stop_early: Do not run further subtests after a failed one.
- runtime_env: (Experimental feature!) The Python environment to run the tests in
"currently-active"
: Use active Python interpreter."as-described"
: Usebioimageio.spec.get_conda_env
to generate a conda environment YAML file based on the model weights description.- A
BioimageioCondaEnv
or a path to a conda environment YAML file. Note: Thebioimageio.core
dependency will be added automatically if not present.
- run_command: (Experimental feature!) Function to execute (conda) terminal commands in a subprocess
(ignored if runtime_env is
"currently-active"
).
162def test_model( 163 source: Union[v0_4.ModelDescr, v0_5.ModelDescr, PermissiveFileSource], 164 weight_format: Optional[SupportedWeightsFormat] = None, 165 devices: Optional[List[str]] = None, 166 *, 167 determinism: Literal["seed_only", "full"] = "seed_only", 168 sha256: Optional[Sha256] = None, 169 stop_early: bool = False, 170 **deprecated: Unpack[DeprecatedKwargs], 171) -> ValidationSummary: 172 """Test model inference""" 173 return test_description( 174 source, 175 weight_format=weight_format, 176 devices=devices, 177 determinism=determinism, 178 expected_type="model", 179 sha256=sha256, 180 stop_early=stop_early, 181 **deprecated, 182 )
Test model inference
190def test_description( 191 source: Union[ResourceDescr, PermissiveFileSource, BioimageioYamlContent], 192 *, 193 format_version: Union[FormatVersionPlaceholder, str] = "discover", 194 weight_format: Optional[SupportedWeightsFormat] = None, 195 devices: Optional[Sequence[str]] = None, 196 determinism: Literal["seed_only", "full"] = "seed_only", 197 expected_type: Optional[str] = None, 198 sha256: Optional[Sha256] = None, 199 stop_early: bool = False, 200 runtime_env: Union[ 201 Literal["currently-active", "as-described"], Path, BioimageioCondaEnv 202 ] = ("currently-active"), 203 run_command: Callable[[Sequence[str]], None] = default_run_command, 204 **deprecated: Unpack[DeprecatedKwargs], 205) -> ValidationSummary: 206 """Test a bioimage.io resource dynamically, 207 for example run prediction of test tensors for models. 208 209 Args: 210 source: model description source. 211 weight_format: Weight format to test. 212 Default: All weight formats present in **source**. 213 devices: Devices to test with, e.g. 'cpu', 'cuda'. 214 Default (may be weight format dependent): ['cuda'] if available, ['cpu'] otherwise. 215 determinism: Modes to improve reproducibility of test outputs. 216 expected_type: Assert an expected resource description `type`. 217 sha256: Expected SHA256 value of **source**. 218 (Ignored if **source** already is a loaded `ResourceDescr` object.) 219 stop_early: Do not run further subtests after a failed one. 220 runtime_env: (Experimental feature!) The Python environment to run the tests in 221 - `"currently-active"`: Use active Python interpreter. 222 - `"as-described"`: Use `bioimageio.spec.get_conda_env` to generate a conda 223 environment YAML file based on the model weights description. 224 - A `BioimageioCondaEnv` or a path to a conda environment YAML file. 225 Note: The `bioimageio.core` dependency will be added automatically if not present. 226 run_command: (Experimental feature!) Function to execute (conda) terminal commands in a subprocess 227 (ignored if **runtime_env** is `"currently-active"`). 228 """ 229 if runtime_env == "currently-active": 230 rd = load_description_and_test( 231 source, 232 format_version=format_version, 233 weight_format=weight_format, 234 devices=devices, 235 determinism=determinism, 236 expected_type=expected_type, 237 sha256=sha256, 238 stop_early=stop_early, 239 **deprecated, 240 ) 241 return rd.validation_summary 242 243 if runtime_env == "as-described": 244 conda_env = None 245 elif isinstance(runtime_env, (str, Path)): 246 conda_env = BioimageioCondaEnv.model_validate(read_yaml(Path(runtime_env))) 247 elif isinstance(runtime_env, BioimageioCondaEnv): 248 conda_env = runtime_env 249 else: 250 assert_never(runtime_env) 251 252 with TemporaryDirectory(ignore_cleanup_errors=True) as _d: 253 working_dir = Path(_d) 254 if isinstance(source, (dict, ResourceDescrBase)): 255 file_source = save_bioimageio_package( 256 source, output_path=working_dir / "package.zip" 257 ) 258 else: 259 file_source = source 260 261 return _test_in_env( 262 file_source, 263 working_dir=working_dir, 264 weight_format=weight_format, 265 conda_env=conda_env, 266 devices=devices, 267 determinism=determinism, 268 expected_type=expected_type, 269 sha256=sha256, 270 stop_early=stop_early, 271 run_command=run_command, 272 **deprecated, 273 )
alias of test_description
204def validate_format( 205 data: BioimageioYamlContent, 206 /, 207 *, 208 format_version: Union[Literal["discover", "latest"], str] = DISCOVER, 209 context: Optional[ValidationContext] = None, 210) -> ValidationSummary: 211 """Validate a dictionary holding a bioimageio description. 212 See `bioimagieo.spec.load_description_and_validate_format_only` 213 to validate a file source. 214 215 Args: 216 data: Dictionary holding the raw bioimageio.yaml content. 217 format_version: Format version to (update to and) use for validation. 218 context: Validation context, see `bioimagieo.spec.ValidationContext` 219 220 Note: 221 Use `bioimagieo.spec.load_description_and_validate_format_only` to validate a 222 file source instead of loading the YAML content and creating the appropriate 223 `ValidationContext`. 224 225 Alternatively you can use `bioimagieo.spec.load_description` and access the 226 `validation_summary` attribute of the returned object. 227 """ 228 with context or get_validation_context(): 229 rd = build_description(data, format_version=format_version) 230 231 assert rd.validation_summary is not None 232 return rd.validation_summary
Validate a dictionary holding a bioimageio description.
See bioimagieo.spec.load_description_and_validate_format_only
to validate a file source.
Arguments:
- data: Dictionary holding the raw bioimageio.yaml content.
- format_version: Format version to (update to and) use for validation.
- context: Validation context, see
bioimagieo.spec.ValidationContext
Note:
Use
bioimagieo.spec.load_description_and_validate_format_only
to validate a file source instead of loading the YAML content and creating the appropriateValidationContext
.Alternatively you can use
bioimagieo.spec.load_description
and access thevalidation_summary
attribute of the returned object.
240class ValidationSummary(BaseModel, extra="allow"): 241 """Summarizes output of all bioimageio validations and tests 242 for one specific `ResourceDescr` instance.""" 243 244 name: str 245 """name of the validation""" 246 source_name: str 247 """source of the validated bioimageio description""" 248 id: Optional[str] = None 249 """ID of the resource being validated""" 250 type: str 251 """type of the resource being validated""" 252 format_version: str 253 """format version of the resource being validated""" 254 status: Literal["passed", "valid-format", "failed"] 255 """overall status of the bioimageio validation""" 256 details: List[ValidationDetail] 257 """list of validation details""" 258 env: Set[InstalledPackage] = Field( 259 default_factory=lambda: { 260 InstalledPackage(name="bioimageio.spec", version=VERSION) 261 } 262 ) 263 """list of selected, relevant package versions""" 264 265 saved_conda_list: Optional[str] = None 266 267 @field_serializer("saved_conda_list") 268 def _save_conda_list(self, value: Optional[str]): 269 return self.conda_list 270 271 @property 272 def conda_list(self): 273 if self.saved_conda_list is None: 274 p = subprocess.run( 275 ["conda", "list"], 276 stdout=subprocess.PIPE, 277 stderr=subprocess.STDOUT, 278 shell=True, 279 text=True, 280 ) 281 self.saved_conda_list = ( 282 p.stdout or f"`conda list` exited with {p.returncode}" 283 ) 284 285 return self.saved_conda_list 286 287 @property 288 def status_icon(self): 289 if self.status == "passed": 290 return "✔️" 291 elif self.status == "valid-format": 292 return "🟡" 293 else: 294 return "❌" 295 296 @property 297 def errors(self) -> List[ErrorEntry]: 298 return list(chain.from_iterable(d.errors for d in self.details)) 299 300 @property 301 def warnings(self) -> List[WarningEntry]: 302 return list(chain.from_iterable(d.warnings for d in self.details)) 303 304 def format( 305 self, 306 *, 307 width: Optional[int] = None, 308 include_conda_list: bool = False, 309 ): 310 """Format summary as Markdown string""" 311 return self._format( 312 width=width, target="md", include_conda_list=include_conda_list 313 ) 314 315 format_md = format 316 317 def format_html( 318 self, 319 *, 320 width: Optional[int] = None, 321 include_conda_list: bool = False, 322 ): 323 md_with_html = self._format( 324 target="html", width=width, include_conda_list=include_conda_list 325 ) 326 return markdown.markdown( 327 md_with_html, extensions=["tables", "fenced_code", "nl2br"] 328 ) 329 330 # TODO: fix bug which casuses extensive white space between the info table and details table 331 # (the generated markdown seems fine) 332 @no_type_check 333 def display( 334 self, 335 *, 336 width: Optional[int] = None, 337 include_conda_list: bool = False, 338 tab_size: int = 4, 339 soft_wrap: bool = True, 340 ) -> None: 341 try: # render as HTML in Jupyter notebook 342 from IPython.core.getipython import get_ipython 343 from IPython.display import display_html 344 except ImportError: 345 pass 346 else: 347 if get_ipython() is not None: 348 _ = display_html( 349 self.format_html( 350 width=width, include_conda_list=include_conda_list 351 ), 352 raw=True, 353 ) 354 return 355 356 # render with rich 357 self._format( 358 target=rich.console.Console( 359 width=width, 360 tab_size=tab_size, 361 soft_wrap=soft_wrap, 362 ), 363 width=width, 364 include_conda_list=include_conda_list, 365 ) 366 367 def add_detail(self, detail: ValidationDetail): 368 if detail.status == "failed": 369 self.status = "failed" 370 elif detail.status != "passed": 371 assert_never(detail.status) 372 373 self.details.append(detail) 374 375 def log( 376 self, 377 to: Union[Literal["display"], Path, Sequence[Union[Literal["display"], Path]]], 378 ) -> List[Path]: 379 """Convenience method to display the validation summary in the terminal and/or 380 save it to disk. See `save` for details.""" 381 if to == "display": 382 display = True 383 save_to = [] 384 elif isinstance(to, Path): 385 display = False 386 save_to = [to] 387 else: 388 display = "display" in to 389 save_to = [p for p in to if p != "display"] 390 391 if display: 392 self.display() 393 394 return self.save(save_to) 395 396 def save( 397 self, path: Union[Path, Sequence[Path]] = Path("{id}_summary_{now}") 398 ) -> List[Path]: 399 """Save the validation/test summary in JSON, Markdown or HTML format. 400 401 Returns: 402 List of file paths the summary was saved to. 403 404 Notes: 405 - Format is chosen based on the suffix: `.json`, `.md`, `.html`. 406 - If **path** has no suffix it is assumed to be a direcotry to which a 407 `summary.json`, `summary.md` and `summary.html` are saved to. 408 """ 409 if isinstance(path, (str, Path)): 410 path = [Path(path)] 411 412 # folder to file paths 413 file_paths: List[Path] = [] 414 for p in path: 415 if p.suffix: 416 file_paths.append(p) 417 else: 418 file_paths.extend( 419 [ 420 p / "summary.json", 421 p / "summary.md", 422 p / "summary.html", 423 ] 424 ) 425 426 now = datetime.now(timezone.utc).strftime("%Y%m%dT%H%M%SZ") 427 for p in file_paths: 428 p = Path(str(p).format(id=self.id or "bioimageio", now=now)) 429 if p.suffix == ".json": 430 self.save_json(p) 431 elif p.suffix == ".md": 432 self.save_markdown(p) 433 elif p.suffix == ".html": 434 self.save_html(p) 435 else: 436 raise ValueError(f"Unknown summary path suffix '{p.suffix}'") 437 438 return file_paths 439 440 def save_json( 441 self, path: Path = Path("summary.json"), *, indent: Optional[int] = 2 442 ): 443 """Save validation/test summary as JSON file.""" 444 json_str = self.model_dump_json(indent=indent) 445 path.parent.mkdir(exist_ok=True, parents=True) 446 _ = path.write_text(json_str, encoding="utf-8") 447 logger.info("Saved summary to {}", path.absolute()) 448 449 def save_markdown(self, path: Path = Path("summary.md")): 450 """Save rendered validation/test summary as Markdown file.""" 451 formatted = self.format_md() 452 path.parent.mkdir(exist_ok=True, parents=True) 453 _ = path.write_text(formatted, encoding="utf-8") 454 logger.info("Saved Markdown formatted summary to {}", path.absolute()) 455 456 def save_html(self, path: Path = Path("summary.html")) -> None: 457 """Save rendered validation/test summary as HTML file.""" 458 path.parent.mkdir(exist_ok=True, parents=True) 459 460 html = self.format_html() 461 _ = path.write_text(html, encoding="utf-8") 462 logger.info("Saved HTML formatted summary to {}", path.absolute()) 463 464 @classmethod 465 def load_json(cls, path: Path) -> Self: 466 """Load validation/test summary from a suitable JSON file""" 467 json_str = Path(path).read_text(encoding="utf-8") 468 return cls.model_validate_json(json_str) 469 470 @field_validator("env", mode="before") 471 def _convert_dict(cls, value: List[Union[List[str], Dict[str, str]]]): 472 """convert old env value for backwards compatibility""" 473 if isinstance(value, list): 474 return [ 475 ( 476 (v["name"], v["version"], v.get("build", ""), v.get("channel", "")) 477 if isinstance(v, dict) and "name" in v and "version" in v 478 else v 479 ) 480 for v in value 481 ] 482 else: 483 return value 484 485 def _format( 486 self, 487 *, 488 target: Union[rich.console.Console, Literal["html", "md"]], 489 width: Optional[int], 490 include_conda_list: bool, 491 ): 492 return _format_summary( 493 self, 494 target=target, 495 width=width or 100, 496 include_conda_list=include_conda_list, 497 )
Summarizes output of all bioimageio validations and tests
for one specific ResourceDescr
instance.
271 @property 272 def conda_list(self): 273 if self.saved_conda_list is None: 274 p = subprocess.run( 275 ["conda", "list"], 276 stdout=subprocess.PIPE, 277 stderr=subprocess.STDOUT, 278 shell=True, 279 text=True, 280 ) 281 self.saved_conda_list = ( 282 p.stdout or f"`conda list` exited with {p.returncode}" 283 ) 284 285 return self.saved_conda_list
304 def format( 305 self, 306 *, 307 width: Optional[int] = None, 308 include_conda_list: bool = False, 309 ): 310 """Format summary as Markdown string""" 311 return self._format( 312 width=width, target="md", include_conda_list=include_conda_list 313 )
Format summary as Markdown string
304 def format( 305 self, 306 *, 307 width: Optional[int] = None, 308 include_conda_list: bool = False, 309 ): 310 """Format summary as Markdown string""" 311 return self._format( 312 width=width, target="md", include_conda_list=include_conda_list 313 )
Format summary as Markdown string
317 def format_html( 318 self, 319 *, 320 width: Optional[int] = None, 321 include_conda_list: bool = False, 322 ): 323 md_with_html = self._format( 324 target="html", width=width, include_conda_list=include_conda_list 325 ) 326 return markdown.markdown( 327 md_with_html, extensions=["tables", "fenced_code", "nl2br"] 328 )
332 @no_type_check 333 def display( 334 self, 335 *, 336 width: Optional[int] = None, 337 include_conda_list: bool = False, 338 tab_size: int = 4, 339 soft_wrap: bool = True, 340 ) -> None: 341 try: # render as HTML in Jupyter notebook 342 from IPython.core.getipython import get_ipython 343 from IPython.display import display_html 344 except ImportError: 345 pass 346 else: 347 if get_ipython() is not None: 348 _ = display_html( 349 self.format_html( 350 width=width, include_conda_list=include_conda_list 351 ), 352 raw=True, 353 ) 354 return 355 356 # render with rich 357 self._format( 358 target=rich.console.Console( 359 width=width, 360 tab_size=tab_size, 361 soft_wrap=soft_wrap, 362 ), 363 width=width, 364 include_conda_list=include_conda_list, 365 )
375 def log( 376 self, 377 to: Union[Literal["display"], Path, Sequence[Union[Literal["display"], Path]]], 378 ) -> List[Path]: 379 """Convenience method to display the validation summary in the terminal and/or 380 save it to disk. See `save` for details.""" 381 if to == "display": 382 display = True 383 save_to = [] 384 elif isinstance(to, Path): 385 display = False 386 save_to = [to] 387 else: 388 display = "display" in to 389 save_to = [p for p in to if p != "display"] 390 391 if display: 392 self.display() 393 394 return self.save(save_to)
Convenience method to display the validation summary in the terminal and/or
save it to disk. See save
for details.
396 def save( 397 self, path: Union[Path, Sequence[Path]] = Path("{id}_summary_{now}") 398 ) -> List[Path]: 399 """Save the validation/test summary in JSON, Markdown or HTML format. 400 401 Returns: 402 List of file paths the summary was saved to. 403 404 Notes: 405 - Format is chosen based on the suffix: `.json`, `.md`, `.html`. 406 - If **path** has no suffix it is assumed to be a direcotry to which a 407 `summary.json`, `summary.md` and `summary.html` are saved to. 408 """ 409 if isinstance(path, (str, Path)): 410 path = [Path(path)] 411 412 # folder to file paths 413 file_paths: List[Path] = [] 414 for p in path: 415 if p.suffix: 416 file_paths.append(p) 417 else: 418 file_paths.extend( 419 [ 420 p / "summary.json", 421 p / "summary.md", 422 p / "summary.html", 423 ] 424 ) 425 426 now = datetime.now(timezone.utc).strftime("%Y%m%dT%H%M%SZ") 427 for p in file_paths: 428 p = Path(str(p).format(id=self.id or "bioimageio", now=now)) 429 if p.suffix == ".json": 430 self.save_json(p) 431 elif p.suffix == ".md": 432 self.save_markdown(p) 433 elif p.suffix == ".html": 434 self.save_html(p) 435 else: 436 raise ValueError(f"Unknown summary path suffix '{p.suffix}'") 437 438 return file_paths
Save the validation/test summary in JSON, Markdown or HTML format.
Returns:
List of file paths the summary was saved to.
Notes:
- Format is chosen based on the suffix:
.json
,.md
,.html
. - If path has no suffix it is assumed to be a direcotry to which a
summary.json
,summary.md
andsummary.html
are saved to.
440 def save_json( 441 self, path: Path = Path("summary.json"), *, indent: Optional[int] = 2 442 ): 443 """Save validation/test summary as JSON file.""" 444 json_str = self.model_dump_json(indent=indent) 445 path.parent.mkdir(exist_ok=True, parents=True) 446 _ = path.write_text(json_str, encoding="utf-8") 447 logger.info("Saved summary to {}", path.absolute())
Save validation/test summary as JSON file.
449 def save_markdown(self, path: Path = Path("summary.md")): 450 """Save rendered validation/test summary as Markdown file.""" 451 formatted = self.format_md() 452 path.parent.mkdir(exist_ok=True, parents=True) 453 _ = path.write_text(formatted, encoding="utf-8") 454 logger.info("Saved Markdown formatted summary to {}", path.absolute())
Save rendered validation/test summary as Markdown file.
456 def save_html(self, path: Path = Path("summary.html")) -> None: 457 """Save rendered validation/test summary as HTML file.""" 458 path.parent.mkdir(exist_ok=True, parents=True) 459 460 html = self.format_html() 461 _ = path.write_text(html, encoding="utf-8") 462 logger.info("Saved HTML formatted summary to {}", path.absolute())
Save rendered validation/test summary as HTML file.
464 @classmethod 465 def load_json(cls, path: Path) -> Self: 466 """Load validation/test summary from a suitable JSON file""" 467 json_str = Path(path).read_text(encoding="utf-8") 468 return cls.model_validate_json(json_str)
Load validation/test summary from a suitable JSON file