Sxian/clt 2919/integrate platform audio by xianshijing-lk · Pull Request #669 · livekit/python-sdks

xianshijing-lk · 2026-05-15T23:31:13Z

Summary

Adds PlatformAudio support to the Python SDK, enabling microphone capture via WebRTC's Audio Device Module (ADM) with built-in voice processing.

Changes

New: livekit-rtc/livekit/rtc/platform_audio.py

PlatformAudio - Main class for ADM access
- recording_devices() / playout_devices() - Device enumeration
- set_recording_device() / set_playout_device() - Device selection by GUID
- create_audio_source() - Create audio source with processing options
PlatformAudioSource - Audio source backed by ADM (no capture_frame() needed)
PlatformAudioOptions - Configure AEC, NS, AGC, prefer_hardware
AudioDeviceInfo - Device info (index, name, id/GUID)

Modified: livekit-rtc/livekit/rtc/track.py

LocalAudioTrack.create_audio_track() now accepts both AudioSource and PlatformAudioSource

Modified: livekit-rtc/livekit/rtc/init.py

Export: PlatformAudio, PlatformAudioSource, PlatformAudioOptions, PlatformAudioError, AudioDeviceInfo

Modified: livekit-rtc/livekit/rtc/room.py

Commented out ready_for_room_event (not yet in released livekit-ffi)

Modified: examples/basic_room.py

Complete rewrite demonstrating both audio modes:
- --platform-audio - Use ADM with voice processing (recommended)
- --file WAV_PATH - Publish WAV file via synthetic mode
- --list-devices - List available audio devices
- --mic-id / --speaker-id - Select specific devices
- --room - Room name
Demonstrates mixing both modes (microphone + file simultaneously)

New: examples/README.md

Documentation for all examples
Detailed explanation of PlatformAudio vs Synthetic mode

PlatformAudio vs Synthetic Mode
┌───────────────────────────────┬───────────────┬───────────────────────────────────────┐
│ Feature │ PlatformAudio │ Synthetic │
├───────────────────────────────┼───────────────┼───────────────────────────────────────┤
│ Voice processing (AEC/NS/AGC) │ Built-in │ Manual │
├───────────────────────────────┼───────────────┼───────────────────────────────────────┤
│ Raw frame access │ No │ Yes │
├───────────────────────────────┼───────────────┼───────────────────────────────────────┤
│ External audio libs needed │ No │ Yes │
├───────────────────────────────┼───────────────┼───────────────────────────────────────┤
│ Use case │ Voice calls │ Custom processing, TTS, file playback │
└───────────────────────────────┴───────────────┴───────────────────────────────────────┘
Both modes can run simultaneously (e.g., mic + background music).

Test Procedure

List audio devices

cd examples
python basic_room.py --list-devices
Expected: Lists available microphones and speakers with device IDs.

Connect with PlatformAudio

Start LiveKit server

livekit-server --dev

In another terminal

export LIVEKIT_URL=ws://localhost:7880
export LIVEKIT_API_KEY=devkey
export LIVEKIT_API_SECRET=secret

python basic_room.py --platform-audio --room test-room
Expected: Connects to room, publishes microphone track with voice processing.

Test with specific device

python basic_room.py --platform-audio --mic-id "" --room test-room
Expected: Uses specified microphone.

Test WAV file playback (synthetic mode)

python basic_room.py --file test.wav --room test-room
Expected: Publishes audio from WAV file.

Test mixed mode (PlatformAudio + file)

python basic_room.py --platform-audio --file test.wav --room test-room
Expected: Publishes two audio tracks - microphone and file.

Verify with second participant

Open https://meet.livekit.io and join the same room to verify audio is received.

- Update rust-sdks submodule to latest main (497527d4) - Regenerate protobuf files - Includes: Fix LocalTrackPublished handle leak, WebRTC ADM integration, and other improvements Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

cloudwebrtc · 2026-05-19T13:26:38Z

@copilot, please check the ruff format CI, and try to fix the format issue.

Agent-Logs-Url: https://github.com/livekit/python-sdks/sessions/df3730e3-6da1-4dbf-a628-b5f7d511886f Co-authored-by: cloudwebrtc <17733746+cloudwebrtc@users.noreply.github.com>

Copilot · 2026-05-19T13:41:30Z

@copilot, please check the ruff format CI, and try to fix the format issue.

Addressed in 356d5c0. I reproduced the Ruff workflow locally, removed the unused import blocking Ruff, applied ruff format to the files flagged by CI, and make check now passes.

CLAassistant · 2026-05-19T13:41:31Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ xianshijing-lk
❌ Copilot
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

xianshijing-lk requested review from cloudwebrtc and lukasIO as code owners May 15, 2026 23:31

This comment was marked as resolved.

Sign in to view

xianshijing-lk force-pushed the sxian/CLT-2919/integrate_platformAudio branch from f519dde to 52bed9f Compare May 15, 2026 23:39

xianshijing-lk changed the base branch from main to sxian/CLT-2919/update-livekit-ffi-0.12.57 May 15, 2026 23:46

Base automatically changed from sxian/CLT-2919/update-livekit-ffi-0.12.57 to main May 18, 2026 10:26

xianshijing-lk and others added 5 commits May 18, 2026 11:53

Update livekit-ffi to 0.12.57

a4d3a74

- Update rust-sdks submodule to latest main (497527d4) - Regenerate protobuf files - Includes: Fix LocalTrackPublished handle leak, WebRTC ADM integration, and other improvements Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

generated protobuf

30da99a

implement platform audio on python

5f3eca3

added unit tests for platformAudio

40e1e65

addressed the comment and fixed publish_mic.py and local_audio.py

2ef05f0

xianshijing-lk force-pushed the sxian/CLT-2919/integrate_platformAudio branch from 52bed9f to 2ef05f0 Compare May 18, 2026 18:53

updated the readme

f2d2da4

Copilot started work on behalf of cloudwebrtc May 19, 2026 13:37 View session

Fix ruff formatting issues

356d5c0

Agent-Logs-Url: https://github.com/livekit/python-sdks/sessions/df3730e3-6da1-4dbf-a628-b5f7d511886f Co-authored-by: cloudwebrtc <17733746+cloudwebrtc@users.noreply.github.com>

Copilot finished work on behalf of cloudwebrtc May 19, 2026 13:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sxian/clt 2919/integrate platform audio#669

Sxian/clt 2919/integrate platform audio#669
xianshijing-lk wants to merge 7 commits into
mainfrom
sxian/CLT-2919/integrate_platformAudio

xianshijing-lk commented May 15, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

cloudwebrtc commented May 19, 2026

Uh oh!

Copilot AI commented May 19, 2026

Uh oh!

CLAassistant commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

xianshijing-lk commented May 15, 2026

Start LiveKit server

In another terminal

Uh oh!

This comment was marked as resolved.

Uh oh!

cloudwebrtc commented May 19, 2026

Uh oh!

Copilot AI commented May 19, 2026

Uh oh!

CLAassistant commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants