Open API

MediCut Audio/Video Integration API

End-to-end AI video, multi-track audio & AI music as an open service

API Capability Overview

MediCut Audio/Video Integration API is an end-to-end open service built for enterprise developers, partner platforms, and product teams. It combines intelligent audio analysis, AI video editing, multi-track timeline creation, universal media automation, and batch compute orchestration. The API stack is layered and production-oriented, supporting both direct end-user usage and enterprise integrations with async tasks, idempotency, quota billing, and permission control. It can be used across short-video creation, audio post-production, creator workflows, media asset processing, and automated commercial media pipelines to quickly enable AI media capabilities.

Core Advantages

(1) Deep AI audio analysis with precise low-level feature understanding

Support full-dimensional AI analysis for single or batch audio files, including metadata, rhythm/style, BPM, chords, vocals, instruments, energy curve, timbre, and production characteristics. Also provides professional stem separation for vocals, drums, bass, piano, accompaniment, and more, with custom accompaniment generation and result export for analysis, reconstruction, asset decomposition, and quality control.

(2) One-stop AI video creation and editing for end-to-end workflows

Integrates audio extraction, smart BGM replacement, and fine-grained multi-track timeline editing. Supports vocal separation, vocal-removed video composition, multi-segment audio mixing, fades, and gain control. Built-in F7-style timeline tools enable layered tracks, clip editing, track mute, and custom render duration for professional-grade post-production.

(3) Lightweight yet comprehensive universal media automation

Built on a unified media task pipeline with broad capability coverage: super-resolution enhancement, face/license privacy masking, format conversion, AI denoise, clipping and stitching, watermark add/remove, subtitle burn-in, beauty filters, picture-in-picture, A/V sync, speed ramping, and cover generation. Supports both single-file processing and batch scheduling to reduce manual editing costs.

(4) Enterprise-grade Open API designed for commercial integration

Provides an independent commercial API system aligned with end-user capabilities, with API Key-based authentication. Supports custom idempotency, async callbacks, quota management, rate limiting, detailed billing reconciliation, robust error handling, and task-state observability. Easy to integrate into third-party platforms, SaaS systems, and smart devices.

(5) Stable and reliable engineering service architecture

Task interfaces support idempotent duplicate-prevention to avoid repeated billing or execution. Async callbacks ensure near real-time task synchronization. Unified response schemas, standardized error codes, and package quota validation improve production reliability. Supports mainstream media formats, large files, and custom output resolution/bitrate/framerate.

(6) In-house top-tier AI algorithms for lossless one-pass output

Powered by a custom Kaiser-sinc interpolation algorithm tuned across diverse audio samples for high-fidelity band-limited reconstruction with strong stopband suppression. A one-read/one-write pipeline avoids repeated quality loss, improves export efficiency, ensures consistent full-link output quality, and minimizes external dependency-driven degradation.

Use Cases

Short-video creators: remove vocals for backing tracks, extract vocals for subtitles, clean noise for secondary edits, and combine voice effects for viral content at scale.

Film/pro editors: high-precision stem separation for vocals and instruments, suitable for end-to-end post-production and long-form source material.

Live streaming/MCN: extract vocals from replays, remove background music, and quickly repurpose long streams into monetizable derivative content.

Online education: enhance instructor voices, remove background noise, and improve listening clarity for long recorded lessons.

Enterprise office: extract speech from meeting recordings, denoise and archive content, and support SaaS-based collaboration and review.

Music creators/self-media: precise stem separation plus AI music and sound effect workflows for original and adapted productions with high audio fidelity.

Full API Capability List

FeatureDescriptionCategory
Video size settingsSupports up to 8K output with custom resolution, bitrate, aspect ratio, and frame rate.Core
Multi-segment mixed editingImport multiple videos and photos for mixed editing workflows.Core
Keyframes and curvesAdd keyframes and curve animation to position, effects, volume, and other parameters.Core
Basic video editingPrecise trimming, multi-clip stitching, and transitions for lightweight editing.Core
Speed controlGlobal/segment speed changes with curve control, up to 100x while preserving pitch.Core
Crop and rotateAspect crop, angle rotation, and mirroring for platform-specific framing.Core
A/V sync correctionDetect and correct desync or delay with custom offset controls.Core
Flexible effectsApply filters, subtitles, transitions, and audio effects across timeline dimensions.Core
Node-based effectsCompose multi-layer effects in node-style structures for complex rendering.Core
Video quality enhancementDeblocking, denoise, super-resolution, and restoration for low-quality footage.Core
Video privacy maskingFace/plate auto-masking, region masking, and blur for privacy compliance.Core
External effect APITexture/effect integration interfaces for third-party effect pipelines.Core
Batch media processingBatch edit, transcode, beautify, and denoise media assets efficiently.Core
Custom exportCustom clips, quality presets, and encoder parameters for tailored output.Core
Regional effectsApply visual effects to custom local regions in the frame.Filter
Image parameter tuningFine-grained control of brightness, contrast, saturation, highlights, shadows, etc.Image
Picture in pictureDual-layer composition with custom sub-video size, position, and animation.Image
Add watermarkText/image watermark with custom position, opacity, and animation.Image
Remove watermarkIntelligent repair algorithms for logo/text removal and cleaner visuals.Image
One-click muteRemove all audio while preserving original video visuals.Image
Cover settingsCapture frame or upload custom image as cover with quality optimization.Image
Voice-overIndependent multi-segment voice-over recording and arrangement.Audio
Independent volume controlSeparate gain controls for original sound, BGM, and voice-over tracks.Audio
Extract audio from videoOne-click audio extraction with multi-track choice and format options.Audio
Replace background musicSwap original BGM while preserving vocals for short-video remixing.Audio
A/V stem separationSeparate vocal and accompaniment tracks for music re-creation.Audio
Music trimmingTrim specific audio segments for soundtrack usage.Audio
AI audio denoiseRemove environmental noise/reverb and improve clarity intelligently.Audio
Multi-track music mixingOverlay multiple audio tracks with support for mainstream formats.Audio
Fade in/outSmooth transitions at clip edges for better listening quality.Audio
Voice effectsBuilt-in pitch/voice transformation presets for creative audio.Audio
Audio equalizerIndependent low/mid/high-band tuning for precise sound shaping.Audio
Music in/out pointsControl start/end playback points for each music segment.Audio
Audio parameter adjustmentGain, loudness normalization, and channel switching controls.Audio
Multi-segment subtitlesBatch subtitle segments with custom display time ranges.Subtitle
Subtitle positionFree subtitle placement on screen.Subtitle
Text sizeContinuously adjustable text size for different resolutions.Subtitle
Subtitle rotationCustom rotation angles for stylized text layouts.Subtitle
Subtitle colorCustom text color and transparency settings.Subtitle
Subtitle fontImport external fonts for richer typography.Subtitle
Subtitle alignmentLeft/center/right alignment for multiline subtitles.Subtitle
Text style processingBold, italic, shadow, underline, and other base styles.Subtitle
Text spacingControl letter spacing and line spacing for readability.Subtitle
Subtitle strokeCustom stroke color, width, and opacity for legibility.Subtitle
Horizontal/vertical textSwitch subtitle orientation for different aesthetics.Subtitle
Subtitle maskUse subtitle regions as layer masks for creative effects.Subtitle
Preset style packApply combined subtitle effect styles in one click.Subtitle
Per-character animationAnimate individual characters with multiple motion patterns.Subtitle
Background animationDecorative motion effects for subtitle backgrounds.Subtitle
Border subtitleDynamic borders for chat bubble and dialogue scenes.Subtitle
Karaoke subtitleAdaptive subtitle backgrounds with karaoke-style color changes.Subtitle
Property composition effectsCombine text attributes for custom visual results.Subtitle
Text animationIn/out/combined text animation presets.Subtitle
Text bubblesStatic/dynamic bubble backgrounds for commentary videos.Subtitle

Open API

MediCut Audio/Video Integration API

End-to-end AI video, multi-track audio & AI music as an open service

API Capability Overview

MediCut Audio/Video Integration API is an end-to-end open service built for enterprise developers, partner platforms, and product teams. It combines intelligent audio analysis, AI video editing, multi-track timeline creation, universal media automation, and batch compute orchestration. The API stack is layered and production-oriented, supporting both direct end-user usage and enterprise integrations with async tasks, idempotency, quota billing, and permission control. It can be used across short-video creation, audio post-production, creator workflows, media asset processing, and automated commercial media pipelines to quickly enable AI media capabilities.

Core Advantages

(1) Deep AI audio analysis with precise low-level feature understanding

Support full-dimensional AI analysis for single or batch audio files, including metadata, rhythm/style, BPM, chords, vocals, instruments, energy curve, timbre, and production characteristics. Also provides professional stem separation for vocals, drums, bass, piano, accompaniment, and more, with custom accompaniment generation and result export for analysis, reconstruction, asset decomposition, and quality control.

(2) One-stop AI video creation and editing for end-to-end workflows

Integrates audio extraction, smart BGM replacement, and fine-grained multi-track timeline editing. Supports vocal separation, vocal-removed video composition, multi-segment audio mixing, fades, and gain control. Built-in F7-style timeline tools enable layered tracks, clip editing, track mute, and custom render duration for professional-grade post-production.

(3) Lightweight yet comprehensive universal media automation

Built on a unified media task pipeline with broad capability coverage: super-resolution enhancement, face/license privacy masking, format conversion, AI denoise, clipping and stitching, watermark add/remove, subtitle burn-in, beauty filters, picture-in-picture, A/V sync, speed ramping, and cover generation. Supports both single-file processing and batch scheduling to reduce manual editing costs.

(4) Enterprise-grade Open API designed for commercial integration

Provides an independent commercial API system aligned with end-user capabilities, with API Key-based authentication. Supports custom idempotency, async callbacks, quota management, rate limiting, detailed billing reconciliation, robust error handling, and task-state observability. Easy to integrate into third-party platforms, SaaS systems, and smart devices.

(5) Stable and reliable engineering service architecture

Task interfaces support idempotent duplicate-prevention to avoid repeated billing or execution. Async callbacks ensure near real-time task synchronization. Unified response schemas, standardized error codes, and package quota validation improve production reliability. Supports mainstream media formats, large files, and custom output resolution/bitrate/framerate.

(6) In-house top-tier AI algorithms for lossless one-pass output

Powered by a custom Kaiser-sinc interpolation algorithm tuned across diverse audio samples for high-fidelity band-limited reconstruction with strong stopband suppression. A one-read/one-write pipeline avoids repeated quality loss, improves export efficiency, ensures consistent full-link output quality, and minimizes external dependency-driven degradation.

Use Cases

Short-video creators: remove vocals for backing tracks, extract vocals for subtitles, clean noise for secondary edits, and combine voice effects for viral content at scale.

Film/pro editors: high-precision stem separation for vocals and instruments, suitable for end-to-end post-production and long-form source material.

Live streaming/MCN: extract vocals from replays, remove background music, and quickly repurpose long streams into monetizable derivative content.

Online education: enhance instructor voices, remove background noise, and improve listening clarity for long recorded lessons.

Enterprise office: extract speech from meeting recordings, denoise and archive content, and support SaaS-based collaboration and review.

Music creators/self-media: precise stem separation plus AI music and sound effect workflows for original and adapted productions with high audio fidelity.

Full API Capability List

FeatureDescriptionCategory
Video size settingsSupports up to 8K output with custom resolution, bitrate, aspect ratio, and frame rate.Core
Multi-segment mixed editingImport multiple videos and photos for mixed editing workflows.Core
Keyframes and curvesAdd keyframes and curve animation to position, effects, volume, and other parameters.Core
Basic video editingPrecise trimming, multi-clip stitching, and transitions for lightweight editing.Core
Speed controlGlobal/segment speed changes with curve control, up to 100x while preserving pitch.Core
Crop and rotateAspect crop, angle rotation, and mirroring for platform-specific framing.Core
A/V sync correctionDetect and correct desync or delay with custom offset controls.Core
Flexible effectsApply filters, subtitles, transitions, and audio effects across timeline dimensions.Core
Node-based effectsCompose multi-layer effects in node-style structures for complex rendering.Core
Video quality enhancementDeblocking, denoise, super-resolution, and restoration for low-quality footage.Core
Video privacy maskingFace/plate auto-masking, region masking, and blur for privacy compliance.Core
External effect APITexture/effect integration interfaces for third-party effect pipelines.Core
Batch media processingBatch edit, transcode, beautify, and denoise media assets efficiently.Core
Custom exportCustom clips, quality presets, and encoder parameters for tailored output.Core
Regional effectsApply visual effects to custom local regions in the frame.Filter
Image parameter tuningFine-grained control of brightness, contrast, saturation, highlights, shadows, etc.Image
Picture in pictureDual-layer composition with custom sub-video size, position, and animation.Image
Add watermarkText/image watermark with custom position, opacity, and animation.Image
Remove watermarkIntelligent repair algorithms for logo/text removal and cleaner visuals.Image
One-click muteRemove all audio while preserving original video visuals.Image
Cover settingsCapture frame or upload custom image as cover with quality optimization.Image
Voice-overIndependent multi-segment voice-over recording and arrangement.Audio
Independent volume controlSeparate gain controls for original sound, BGM, and voice-over tracks.Audio
Extract audio from videoOne-click audio extraction with multi-track choice and format options.Audio
Replace background musicSwap original BGM while preserving vocals for short-video remixing.Audio
A/V stem separationSeparate vocal and accompaniment tracks for music re-creation.Audio
Music trimmingTrim specific audio segments for soundtrack usage.Audio
AI audio denoiseRemove environmental noise/reverb and improve clarity intelligently.Audio
Multi-track music mixingOverlay multiple audio tracks with support for mainstream formats.Audio
Fade in/outSmooth transitions at clip edges for better listening quality.Audio
Voice effectsBuilt-in pitch/voice transformation presets for creative audio.Audio
Audio equalizerIndependent low/mid/high-band tuning for precise sound shaping.Audio
Music in/out pointsControl start/end playback points for each music segment.Audio
Audio parameter adjustmentGain, loudness normalization, and channel switching controls.Audio
Multi-segment subtitlesBatch subtitle segments with custom display time ranges.Subtitle
Subtitle positionFree subtitle placement on screen.Subtitle
Text sizeContinuously adjustable text size for different resolutions.Subtitle
Subtitle rotationCustom rotation angles for stylized text layouts.Subtitle
Subtitle colorCustom text color and transparency settings.Subtitle
Subtitle fontImport external fonts for richer typography.Subtitle
Subtitle alignmentLeft/center/right alignment for multiline subtitles.Subtitle
Text style processingBold, italic, shadow, underline, and other base styles.Subtitle
Text spacingControl letter spacing and line spacing for readability.Subtitle
Subtitle strokeCustom stroke color, width, and opacity for legibility.Subtitle
Horizontal/vertical textSwitch subtitle orientation for different aesthetics.Subtitle
Subtitle maskUse subtitle regions as layer masks for creative effects.Subtitle
Preset style packApply combined subtitle effect styles in one click.Subtitle
Per-character animationAnimate individual characters with multiple motion patterns.Subtitle
Background animationDecorative motion effects for subtitle backgrounds.Subtitle
Border subtitleDynamic borders for chat bubble and dialogue scenes.Subtitle
Karaoke subtitleAdaptive subtitle backgrounds with karaoke-style color changes.Subtitle
Property composition effectsCombine text attributes for custom visual results.Subtitle
Text animationIn/out/combined text animation presets.Subtitle
Text bubblesStatic/dynamic bubble backgrounds for commentary videos.Subtitle