evaluatePython

A Lambda Feedback evaluation function that executes student Python code submissions in a secure sandbox, runs them against test cases, and returns structured formative feedback. Deployed as a Docker container on the Lambda Feedback platform.

Deployment

Push to main triggers GitHub Actions which automatically builds and deploys to Lambda Feedback. See .github/workflows/ for CI/CD configuration.

Usage

Run the Docker Image

docker run -it --rm -p 8080:8080 ghcr.io/lambda-feedback/evaluatepython:latest

The image includes Shimmy, which listens for HTTP requests on port 8080 and forwards them to the evaluation function.

Evaluation Modes

The function supports three modes, set via params.mode.

demo — run student code and show output (no pass/fail):

{
  "response": "print(5 * 5)",
  "params": { "mode": "demo" }
}

io_test — compare stdout against expected output for each test case:

{
  "response": "n = int(input())\nprint(n * n)",
  "params": {
    "mode": "io_test",
    "tests": [
      { "input": "5\n", "expected_output": "25\n" },
      { "input": "3\n", "expected_output": "9\n", "hidden": true }
    ]
  }
}

unit_test — run student code then execute test_* functions or unittest.TestCase subclasses (including Hypothesis tests):

{
  "response": "def square(n): return n * n",
  "params": {
    "mode": "unit_test",
    "test_code": "def test_positive():\n    assert square(5) == 25\ndef test_zero():\n    assert square(0) == 0\n"
  }
}

Development

Prerequisites

Python 3.12+
Poetry
Docker (for container builds)

Repository Structure

evaluation_function/main.py             # IPC server entry point
evaluation_function/evaluation.py       # core evaluation pipeline (all three modes)
evaluation_function/preview.py          # AST-based security validator
evaluation_function/dev.py              # CLI wrapper for local testing
evaluation_function/evaluation_test.py  # integration tests
evaluation_function/preview_test.py     # preview/security tests
config.json                             # deployment configuration

Setup

poetry install

Local Testing

The dev.py script calls the evaluation function directly (no Docker required). It defaults to demo mode if no params are supplied:

# demo mode (default)
python -m evaluation_function.dev "print(5 * 5)"

# io_test mode
python -m evaluation_function.dev "print(int(input())**2)" "" \
  '{"mode":"io_test","tests":[{"input":"5\n","expected_output":"25\n"}]}'

# unit_test mode
python -m evaluation_function.dev "def square(n): return n*n" "" \
  '{"mode":"unit_test","test_code":"def test_sq():\n    assert square(3)==9\n"}'

Running Tests

pytest

Linting

# Critical errors (fail CI)
flake8 ./evaluation_function --count --select=E9,F63,F7,F82 --show-source --statistics
# Style/complexity (informational)
flake8 ./evaluation_function --count --exit-zero --max-complexity=10 --max-line-length=127 --statistics

Building the Docker Image

docker build -t evaluatepython .
# Cross-platform (CI uses linux/x86_64):
docker build --platform=linux/x86_64 -t evaluatepython .

Running the Docker Image

docker run -it --rm -p 8080:8080 evaluatepython

Deployment to Lambda Feedback

The function name is declared in config.json as "evaluatePython" (lowerCamelCase). Pushing to main triggers automated deployment via GitHub Actions.

Important

The evaluation function name must be unique within the Lambda Feedback organization and must be in lowerCamelCase.

Troubleshooting

Containerized Function Fails to Start

Run-time dependencies: ensure all packages are in pyproject.toml and installed via poetry install in the Dockerfile.
Architecture: some packages are platform-specific. Build with --platform=linux/x86_64 to match the CI/production environment.
Standalone check: run the function directly inside the container to isolate startup errors:

docker run -it --rm evaluatepython python -m evaluation_function.main

Pulling Changes from the Template Repository

git remote add template https://github.com/lambda-feedback/evaluation-function-boilerplate-python.git
git fetch --all
git merge template/main --allow-unrelated-histories

Warning

Resolve conflicts carefully — template updates may overwrite evaluatePython-specific code.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github		.github
docs		docs
evaluation_function		evaluation_function
postman		postman
.dockerignore		.dockerignore
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
README.md		README.md
config.json		config.json
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

evaluatePython

Deployment

Usage

Run the Docker Image

Evaluation Modes

Development

Prerequisites

Repository Structure

Setup

Local Testing

Running Tests

Linting

Building the Docker Image

Running the Docker Image

Deployment to Lambda Feedback

Troubleshooting

Containerized Function Fails to Start

Pulling Changes from the Template Repository

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

evaluatePython

Deployment

Usage

Run the Docker Image

Evaluation Modes

Development

Prerequisites

Repository Structure

Setup

Local Testing

Running Tests

Linting

Building the Docker Image

Running the Docker Image

Deployment to Lambda Feedback

Troubleshooting

Containerized Function Fails to Start

Pulling Changes from the Template Repository

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages