Virtual Environments¶
Info
The guide below is only meant for reference only and not meant to be followed verbatim. You may need to generate your own guide site if you require guidance specifically for your own project.
While the Docker images you will be using to run experiments on Run:ai would contain the conda
environments you would need, you can also create these virtual environments within your development environment, and have it be persisted. The following set of commands allows you to create the conda
environment and store the packages within your own workspace directory:
-
First, have VSCode open the repository that you have cloned previously by heading over to the top left hand corner, selecting
File > Open Folder...
, and entering the path to the repository. In this case, you should be navigating to the folder/<NAME_OF_DATA_SOURCE>/workspaces/<YOUR_HYPHENATED_NAME>/project
. -
Now, let's initialise
conda
for the bash shell, and create the virtual environment specified inproject-conda-env.yaml
.
# Usually this is fine
conda env create -f project-conda-env.yaml
# However, if your configuration doesn't point towards the PVC source, you can use this instead
# Consult the MLOps team if you're unsure.
conda env create \
-f project-conda-env.yaml \
-p /<NAME_OF_DATA_SOURCE>/workspaces/<YOUR_HYPHENATED_NAME>/conda_envs/project
If you're using the 2nd conda env create
option
After creating the conda
environment, you can create a permanent alias for easy activation.
echo 'alias project-conda="conda activate /<NAME_OF_DATA_SOURCE>/workspaces/<YOUR_HYPHENATED_NAME>/conda_envs/project"' >> ~/.bashrc
source ~/.bashrc
project-conda
# conda environment has been activated as project
Tip
If you encounter issues in trying to install Python libraries, do ensure that the amount of resources allocated to the VSCode server is sufficient. Installation of libraries from PyPI tends to fail when there's insufficient memory. For starters, dedicate 4GB of memory to the service:
Another way is to add the flag --no-cache-dir
for your pip install
executions. However, there's no similar flag for conda
at the moment so the above is a blanket solution.
The Conda YAML configuration file uses the requirements.txt
to create a Conda environment. This is so that there is parity between the development and deployment environment as the Docker image would use requirements.txt
as the list of packages to be installed.
You may have to do extra pip installations, depending on your project's requirements. You can install them through pip install -r <file>
.
Reference Link(s)
Jupyter Kernel for VSCode¶
While it is possible for VSCode to make use of different virtual Python environments, some other additional steps are required for the VSCode server to detect the conda
environments that you would have created.
-
Ensure that you are in a project folder which you intend to work on. You can open a folder through
File > Open Folder...
. In this case, you should be navigating to the folder/<NAME_OF_DATA_SOURCE>/workspaces/<YOUR_HYPHENATED_NAME>/project
. -
Install the VSCode extensions
ms-python.python
andms-toolsai.jupyter
. After installation of these extensions, restart VSCode by using the shortcutCtrl + Shift + P
, enteringDeveloper: Reload Window
in the prompt and pressingEnter
following that.
Manual Installation
For some clusters, you may need to install the extensions manually due to firewall issues. If that is the case, you can download the extension(s) through your local machine and upload them to the VSCode terminal. From there, you can make use of the following command:
$ code-server --install-extension /path/to/extension.vsix
- Ensure that you have
ipykernel
installed in theconda
environment that you intend to use. This template by default lists the library as a dependency underrequirements.txt
. You can check for the library like so:
# Usually this is fine
conda activate project
# If you're using the 2nd option
conda activate /<NAME_OF_DATA_SOURCE>/workspaces/<YOUR_HYPHENATED_NAME>/conda_envs/project
conda list | grep "ipykernel"
Output should look similar to:
ipykernel X.XX.XX pypi_0 pypi
-
Now enter
Ctrl + Shift + P
again and executePython: Select Interpreter
. Provide the path to the Python executable within theconda
environment that you intend to use, something like so:path/to/conda_env/bin/python
. -
Open up any Jupyter notebook and click on the button that says
Select Kernel
on the top right hand corner. You will be presented with a selection of Python interpreters. Select the one that corresponds to the environment you intend to use. -
Test out the kernel by running the cells in the sample notebook provided under
notebooks/sample-pytorch-notebook.ipynb
.