Jdaviz Profiler

Jdaviz Profiler is a Python toolkit designed to automate the generation and profiling of Jupyter notebooks for the jdaviz visualization suite. It enables users to systematically test and benchmark jdaviz’s Imviz plugin under a variety of parameter combinations, such as image size, number of images, viewport size, and more.

Features

Notebook Generation:

Automatically creates Jupyter notebooks from a template (template.ipynb) and a parameter configuration file (params.json).

All possible combinations of parameters are generated, allowing for comprehensive profiling.

The template.ipynb file serves as the base notebook, while params.json contains the parameter values to be injected into the notebook.

The template.ipynb must have a cell with placeholders for the parameters to be replaced, therefore this cell must:

precede all other cells with actual code using the parameters.
be tagged with the parameters label.

Each parameter in the params.json file must have a corresponding placeholder in the template.ipynb file, and the placeholders must be unique having _value as suffix, e.g. image_pixel_side_value or viewport_pixel_size_value correspond to image_pixel_side or viewport_pixel_size parameter value used in the template.ipynb.

The generated parameterized notebooks will be saved in the <usecase path>/notebooks directory.

An example of how to structure a new <usecase> (along with template.ipynb and params.json files) is provided in this repository in imviz_images.

Notebook Profiling:

Uses Selenium to launch and interact with JupyterLab, executing each notebook cell and recording performance metrics. Optionally, if a cell is tagged with:

skip_profiling, the performance metrics during the execution of that cell will not be collected.
wait_for_viz, the profiler will wait after cell execution for Imviz to be stable (i.e. all images loaded and rendered) before proceeding to the next cell. This is useful for cells that load images into Imviz, ensuring accurate profiling of rendering times.

Session Management:

Handles JupyterLab sessions, kernel restarts, notebook uploads, and clean-up automatically.

Extensible:

Easily add new parameters or modify the template to test different scenarios, as well as create new <usecases> following the directives under "Notebook Generation".

How It Works

Parameter Setup: Define the parameters and their possible values in params.json.
Notebook Generation: Run the notebook generator to create all combinations of notebooks in the output directory.
Profiling: Use the profiler to execute each notebook cell in a JupyterLab instance, collecting timing and output data for each cell.
Results: The profiling results can be saved in a structured format file (CSV) for analysis.

Installation

To install, check out this repository and run:

pip install -e .

Python 3.12 or later is supported.

Pre-commit hook

To install the pre-commit hook, simply run:

pip install ruff mypy types-requests types-psutil types-tqdm pre-commit
pre-commit install

Usage

Generate all possible notebooks from a usecase:

./notebooks_generator.py --input_dir_path <usecase path>

Profile a specific notebook:
```
./notebook_profiler.py --url <JupyterLab URL> --token <API Token> --kernel_name <kernel name> --nb_input_path <notebook path>
```
Additional arguments:
- --headless: Run the browser in headless mode (default: False, same as --no-headless).
- --max_wait_time: Max time to wait after executing each cell (in seconds, default: 300).
- --screenshots_dir_path: Path to the directory to where screenshots will be stored (default: None, no screenshot will be saved).
- --notebook_metrics_file_path: Path to the file to where the notebook metrics will be stored. (default: None, no notebook metrics will be stored).
- --cell_metrics_file_path: Path to the file to where the cell metrics will be stored. (default: None, no cell metrics will be stored).
Generate all possible notebooks from a usecase and profile all of them:
```
./generate_and_profile.py --input_dir_path <usecase path> --url <JupyterLab URL> --token <API Token> --kernel_name <kernel name>
```
Additional arguments:
- --headless: Run the browser in headless mode (default: False, same as --no-headless).
- --max_wait_time: Max time to wait after executing each cell (in seconds, default: 300).
- --log_screenshots: Whether to log screenshots or not (default: False, same as --no-log_screenshots).
- --save_metrics: Whether to save profiling metrics to a CSV file (default: False, same as --no-save_metrics).

All scripts expose a --log_level argument to set the logging level (DEBUG, INFO, WARNING, ERROR, CRITICAL; default is INFO), and a --log_file argument to specify a log file path (if not provided, logs will only be printed to the console).

All scripts have a --help option for more details on usage and available arguments.

Dependencies

jdaviz
pillow
selenium
chromedriver-py
requests
nbformat
tqdm

License

BSD 3-Clause License

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
imviz_images		imviz_images
src		src
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
generate_and_profile.py		generate_and_profile.py
notebook_profiler.py		notebook_profiler.py
notebooks_generator.py		notebooks_generator.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Jdaviz Profiler

Features

Notebook Generation:

Notebook Profiling:

Session Management:

Extensible:

How It Works

Installation

Pre-commit hook

Usage

Dependencies

License

About

Uh oh!

Releases

Packages

Languages

License

bafio/jdaviz_profiler

Folders and files

Latest commit

History

Repository files navigation

Jdaviz Profiler

Features

Notebook Generation:

Notebook Profiling:

Session Management:

Extensible:

How It Works

Installation

Pre-commit hook

Usage

Dependencies

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages