Installation

Installation

PythonRustNode

🌍 Tokenizers is tested on Python 3.5+.

You should install 🌍 Tokenizers in a virtual environmentarrow-up-right. If you’re unfamiliar with Python virtual environments, check out the user guidearrow-up-right. Create a virtual environment with the version of Python you’re going to use and activate it.

Installation with pip

🌍 Tokenizers can be installed using pip as follows:

Copied

pip install tokenizers

Installation from sources

To use this method, you need to have the Rust language installed. You can follow the official guidearrow-up-right for more information.

If you are using a unix based OS, the installation should be as simple as running:

Copied

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh

Or you can easiy update it with the following command:

Copied

Once rust is installed, we can start retrieving the sources for 🌍 Tokenizers:

Copied

Then we go into the python bindings folder:

Copied

At this point you should have your virtual environmentarrow-up-right already activated. In order to compile 🌍 Tokenizers, you need to install the Python package setuptools_rust:

Copied

Then you can have 🌍 Tokenizers compiled and installed in your virtual environment with the following command:

Copied

Last updated