This file provides guidance to LLM Agents such as Codex, Gemini, Claude Code (claude.ai/code), etc. when working with code in this repository.
- ALL tests MUST pass for code to be considered complete and working
- Never describe code as "working as expected" if there are ANY failing tests
- Even if specific feature tests pass, failing tests elsewhere indicate broken functionality
- Changes that break existing tests must be fixed before considering implementation complete
- A successful implementation must pass linting, type checking, AND all existing tests
libvcs is a lite, typed Python tool for:
- Detecting and parsing URLs for Git, Mercurial, and Subversion repositories
- Providing command abstractions for git, hg, and svn
- Synchronizing repositories locally
- Creating pytest fixtures for testing with temporary repositories
The library powers vcspull, a tool for managing and synchronizing multiple git, svn, and mercurial repositories.
This project uses:
- Python 3.9+
- uv for dependency management
- ruff for linting and formatting
- mypy for type checking
- pytest for testing
# Install dependencies
uv pip install --editable .
uv pip sync
# Install with development dependencies
uv pip install --editable . -G dev# Run all tests
just test
# or directly with pytest
uv run pytest
# Run a single test file
uv run pytest tests/sync/test_git.py
# Run a specific test
uv run pytest tests/sync/test_git.py::test_remotes
# Run tests with test watcher
just start
# or
uv run ptw .# Run ruff for linting
just ruff
# or directly
uv run ruff check .
# Format code with ruff
just ruff-format
# or directly
uv run ruff format .
# Run ruff linting with auto-fixes
uv run ruff check . --fix --show-fixes
# Run mypy for type checking
just mypy
# or directly
uv run mypy src tests
# Watch mode for linting (using entr)
just watch-ruff
just watch-mypyFollow this workflow for code changes:
- Format First:
uv run ruff format . - Run Tests:
uv run pytest - Run Linting:
uv run ruff check . --fix --show-fixes - Check Types:
uv run mypy - Verify Tests Again:
uv run pytest
# Build documentation
just build-docs
# Start documentation server with auto-reload
just start-docs
# Update documentation CSS/JS
just design-docslibvcs is organized into three main modules:
-
URL Detection and Parsing (
libvcs.url)- Base URL classes in
url/base.py - VCS-specific implementations in
url/git.py,url/hg.py, andurl/svn.py - URL registry in
url/registry.py - Constants in
url/constants.py
- Base URL classes in
-
Command Abstraction (
libvcs.cmd)- Command classes for git, hg, and svn in
cmd/git.py,cmd/hg.py, andcmd/svn.py - Built on top of Python's subprocess module (via
_internal/subprocess.py)
- Command classes for git, hg, and svn in
-
Repository Synchronization (
libvcs.sync)- Base sync classes in
sync/base.py - VCS-specific sync implementations in
sync/git.py,sync/hg.py, andsync/svn.py
- Base sync classes in
-
Internal Utilities (
libvcs._internal)- Subprocess wrappers in
_internal/subprocess.py - Data structures in
_internal/dataclasses.pyand_internal/query_list.py - Runtime helpers in
_internal/run.pyand_internal/shortcuts.py
- Subprocess wrappers in
-
pytest Plugin (
libvcs.pytest_plugin)- Provides fixtures for creating temporary repositories for testing
libvcs uses pytest for testing with many custom fixtures. The pytest plugin (pytest_plugin.py) defines fixtures for creating temporary repositories for testing. These include:
create_git_remote_repo: Creates a git repository for testingcreate_hg_remote_repo: Creates a Mercurial repository for testingcreate_svn_remote_repo: Creates a Subversion repository for testinggit_repo,svn_repo,hg_repo: Pre-made repository instancesset_home,gitconfig,hgconfig,git_commit_envvars: Environment fixtures
These fixtures handle setup and teardown automatically, creating isolated test environments.
For running tests with actual VCS commands, tests will be skipped if the corresponding VCS binary is not installed.
- Use functional tests only: Write tests as standalone functions (
test_*), not classes. Avoidclass TestFoo:groupings - use descriptive function names and file organization instead. This applies to pytest tests, not doctests.
def test_repo_sync(git_repo):
# git_repo is already a GitSync instance with a clean repository
# Use it directly in your tests
assert git_repo.get_revision() == "initial"Use typing.NamedTuple for parameterized tests:
class RepoFixture(t.NamedTuple):
test_id: str # For test naming
repo_args: dict[str, t.Any]
expected_result: str
@pytest.mark.parametrize(
list(RepoFixture._fields),
REPO_FIXTURES,
ids=[test.test_id for test in REPO_FIXTURES],
)
def test_sync(
# Parameters and fixtures...
):
# Test implementation- Use namespace imports for stdlib:
import enuminstead offrom enum import Enum; third-party packages may usefrom X import Y - For typing, use
import typing as tand access via namespace:t.NamedTuple, etc. - Use
from __future__ import annotationsat the top of all Python files
Follow NumPy docstring style for all functions and methods:
"""Short description of the function or class.
Detailed description using reStructuredText format.
Parameters
----------
param1 : type
Description of param1
param2 : type
Description of param2
Returns
-------
type
Description of return value
"""All functions and methods MUST have working doctests. Doctests serve as both documentation and tests.
CRITICAL RULES:
- Doctests MUST actually execute - never comment out
asyncio.run()or similar calls - Doctests MUST NOT be converted to
.. code-block::as a workaround (code-blocks don't run) - If you cannot create a working doctest, STOP and ask for help
Available tools for doctests:
doctest_namespacefixtures:tmp_path,asyncio,create_git_remote_repo,create_hg_remote_repo,create_svn_remote_repo,example_git_repo- Ellipsis for variable output:
# doctest: +ELLIPSIS - Update
pytest_plugin.pyto add new fixtures todoctest_namespace
# doctest: +SKIP is NOT permitted - it's just another workaround that doesn't test anything. If a VCS binary might not be installed, pytest already handles skipping via skip_if_binaries_missing. Use the fixtures properly.
Async doctest pattern:
>>> async def example():
... result = await some_async_function()
... return result
>>> asyncio.run(example())
'expected output'Using fixtures in doctests:
>>> git = Git(path=tmp_path) # tmp_path from doctest_namespace
>>> git.run(['status'])
'...'When output varies, use ellipsis:
>>> git.clone(url=f'file://{create_git_remote_repo()}') # doctest: +ELLIPSIS
'Cloning into ...'These rules guide future logging changes; existing code may not yet conform.
- Use
logging.getLogger(__name__)in every module - Add
NullHandlerin library__init__.pyfiles - Never configure handlers, levels, or formatters in library code — that's the application's job
Pass structured data on every log call where useful for filtering, searching, or test assertions.
Core keys (stable, scalar, safe at any log level):
| Key | Type | Context |
|---|---|---|
vcs_cmd |
str |
VCS command line |
vcs_type |
str |
VCS type (git, svn, hg) |
vcs_url |
str |
repository URL |
vcs_exit_code |
int |
VCS process exit code |
vcs_repo_path |
str |
local repository path |
Heavy/optional keys (DEBUG only, potentially large):
| Key | Type | Context |
|---|---|---|
vcs_stdout |
list[str] |
VCS stdout lines (truncate or cap; %(vcs_stdout)s produces repr) |
vcs_stderr |
list[str] |
VCS stderr lines (same caveats) |
Treat established keys as compatibility-sensitive — downstream users may build dashboards and alerts on them. Change deliberately.
snake_case, not dotted;vcs_prefix- Prefer stable scalars; avoid ad-hoc objects
- Heavy keys (
vcs_stdout,vcs_stderr) are DEBUG-only; consider companionvcs_stdout_lenfields or hard truncation (e.g.stdout[:100])
logger.debug("msg %s", val) not f-strings. Two rationales:
- Deferred string interpolation: skipped entirely when level is filtered
- Aggregator message template grouping:
"Running %s"is one signature grouped ×10,000; f-strings make each line unique
When computing val itself is expensive, guard with if logger.isEnabledFor(logging.DEBUG).
Increment for each wrapper layer so %(filename)s:%(lineno)d and OTel code.filepath point to the real caller. Verify whenever call depth changes.
For objects with stable identity (Repository, Remote, Sync), use LoggerAdapter to avoid repeating the same extra on every call. Lead with the portable pattern (override process() to merge); merge_extra=True simplifies this on Python 3.13+.
| Level | Use for | Examples |
|---|---|---|
DEBUG |
Internal mechanics, VCS I/O | VCS command + stdout, URL parsing steps |
INFO |
Repository lifecycle, user-visible operations | Repository cloned, sync completed |
WARNING |
Recoverable issues, deprecation, user-actionable config | Deprecated VCS option, unrecognized remote |
ERROR |
Failures that stop an operation | VCS command failed, invalid URL |
Config discovery noise belongs in DEBUG; only surprising/user-actionable config issues → WARNING.
- Lowercase, past tense for events:
"repository cloned","vcs command failed" - No trailing punctuation
- Keep messages short; put details in
extra, not the message string
- Use
logger.exception()only insideexceptblocks when you are not re-raising - Use
logger.error(..., exc_info=True)when you need the traceback outside anexceptblock - Avoid
logger.exception()followed byraise— this duplicates the traceback. Either add context viaextrathat would otherwise be lost, or let the exception propagate
Assert on caplog.records attributes, not string matching on caplog.text:
- Scope capture:
caplog.at_level(logging.DEBUG, logger="libvcs.cmd") - Filter records rather than index by position:
[r for r in caplog.records if hasattr(r, "vcs_cmd")] - Assert on schema:
record.vcs_exit_code == 0not"exit code 0" in caplog.text caplog.record_tuplescannot access extra fields — always usecaplog.records
- f-strings/
.format()in log calls - Unguarded logging in hot loops (guard with
isEnabledFor()) - Catch-log-reraise without adding new context
print()for diagnostics- Logging secret env var values (log key names only)
- Non-scalar ad-hoc objects in
extra - Requiring custom
extrafields in format strings without safe defaults (missing keys raiseKeyError)
Format commit messages as:
Scope(type[detail]): concise description
why: Explanation of necessity or impact.
what:
- Specific technical changes made
- Focused on a single topic
Common commit types:
- feat: New features or enhancements
- fix: Bug fixes
- refactor: Code restructuring without functional change
- docs: Documentation updates
- chore: Maintenance (dependencies, tooling, config)
- test: Test-related updates
- style: Code style and formatting
- ai(rules[AGENTS]): AI rule updates
- ai(claude[rules]): Claude Code rules (CLAUDE.md)
- ai(claude[command]): Claude Code command changes
Example:
url/git(feat[GitURL]): Add support for custom SSH port syntax
why: Enable parsing of Git URLs with custom SSH ports
what:
- Add port capture to SCP_REGEX pattern
- Update GitURL.to_url() to include port if specified
- Add tests for the new functionality
For multi-line commits, use heredoc to preserve formatting:
git commit -m "$(cat <<'EOF'
feat(Component[method]) add feature description
why: Explanation of the change.
what:
- First change
- Second change
EOF
)"When writing documentation (README, CHANGES, docs/), follow these rules for code blocks:
One command per code block. This makes commands individually copyable. For sequential commands, either use separate code blocks or chain them with && or ; and \ continuations (keeping it one logical command).
Put explanations outside the code block, not as comments inside.
Good:
Run the tests:
$ uv run pytestRun with coverage:
$ uv run pytest --covBad:
# Run the tests
$ uv run pytest
# Run with coverage
$ uv run pytest --covThese rules apply to shell commands in documentation (README, CHANGES, docs/), not to Python doctests.
Use console language tag with $ prefix. This distinguishes interactive commands from scripts and enables prompt-aware copy in many terminals.
Good:
$ uv run pytestBad:
uv run pytestSplit long commands with \ for readability. Each flag or flag+value pair gets its own continuation line, indented. Positional parameters go on the final line.
Good:
$ pipx install \
--suffix=@next \
--pip-args '\--pre' \
--force \
'libvcs'Bad:
$ pipx install --suffix=@next --pip-args '\--pre' --force 'libvcs'When stuck in debugging loops:
- Pause and acknowledge the loop
- Minimize to MVP: Remove all debugging cruft and experimental code
- Document the issue comprehensively for a fresh approach
- Format for portability (using quadruple backticks)