Numerai Model Prediction Docker Environment

Poetry vs. Requirements.txt

We prefer to do dependency management and solving through poetry because it's more sophisticated and powerful, but we also provide a requirements.txt for anyone that doesn't like to use poetry.

To install poetry locally, run:

curl -sSL https://install.python-poetry.org | python -

To convert from poetry to requirements.txt simply run:

poetry export -f requirements.txt --without-hashes --output requirements.txt

# the following regex can be applied to remove the `python_full_version` and `python_version` tags on each row:
sed -i '' 's/; .*$//g' requirements.txt

Building the docker images locally

You can use make to build the docker containers on any of supported python versions:

# to build on python 3.11
make build_3_11

# if you need to force the cache to refresh
make build_3_11 DOCKER_BUILDKIT=0

Local testing of pickle models

You can run a local pickle model via

docker run -i --rm -v "$PWD:$PWD" ghcr.io/numerai/numerai_predict_py_3_11:stable --debug --model $PWD/model.pkl
# optionally, you can run with --platform linux/amd64 or --platform linux/arm64 depending on host architecture

Presigned S3 URLs

Presigned GET and POST urls are used to ensure that only the specified model is downloaded during execution and that model prediction uploads from other models are not accessed or tampered with.

The --model arg is designed to accept a pre-signed S3 GET URL generated via boto3

params = dict(Bucket='numerai-pickled-user-models',
              Key='5a5a8da7-05a4-41bf-9c2b-7f61bab5b89b/model-Kc5pT9r85SRD.pkl')
presigned_model_url = s3_client.generate_presigned_url("get_object", params, ExpiresIn=600)

The --post_url and --post_data args are designed to accept a pre-signed S3 POST URL + urlencoded data dictionary generated via boto3

presigned_post = s3_client.generate_presigned_post(Bucket='numerai-pickled-user-models-live-output',
                                                   Key='5a5a8da7-05a4-41bf-9c2b-7f61bab5b89b/live_predictions-b7446fc4cc7e.csv',
                                                   ExpiresIn=600)
post_url = presigned_post['url']
post_data = urllib.parse.urlencode(presigned_post['fields'])

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
.github/workflows		.github/workflows
py3.10		py3.10
py3.11		py3.11
py3.12		py3.12
shell		shell
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
predict.py		predict.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Numerai Model Prediction Docker Environment

Poetry vs. Requirements.txt

Building the docker images locally

Local testing of pickle models

Presigned S3 URLs

About

Uh oh!

Releases 3

Packages

Uh oh!

Uh oh!

Contributors 4

Uh oh!

Languages

License

numerai/numerai-predict

Folders and files

Latest commit

History

Repository files navigation

Numerai Model Prediction Docker Environment

Poetry vs. Requirements.txt

Building the docker images locally

Local testing of pickle models

Presigned S3 URLs

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Uh oh!

Contributors 4

Uh oh!

Languages

Packages