Parallel Computing: Single Machine

Session Overview

  • Tabular Data
  • Array Data

Tabular Data

  1. Make sure your Module 2 Large Coder Instance is running.
  2. Open up JupyterLab
  3. Navigate to the Module 2 Practice directory.
  4. Stash your changes or commit them to a new branch.
  5. Pull the latest changes from the main branch.

6. In a terminal run:

sudo apt update
sudo apt install npm
sudo npm install -g gtop
gtop


What do you see?

Configuring AWS Credentials

What do we do?

Tabular Data:

If you do not have the UV Env and Kernel set up:

  1. Move into the Module 2 Practice directory.
  2. Use git pull to pull the latest changes to the repository from GitLab.
  3. Run uv sync.
  4. Update your AWS credentials.
  5. Activate your virtual environment.
  6. Run:
python -m ipykernel install --user --name=coder --display-name="Python (UV Environment)"

Make sure you are running your notebook with the UV Kernel.

Tabular Data: Follow Along

  • Open tabular.ipynb
  • Run the cells in order
  • Monitor the CPU usage using gtop

Array Data

  • In a terminal run sh run-jupyter.sh
  • Go to Coder and open the relevant port
  • After launching Dask, open Dask port as well.