Instructor lane
The clear director voice: task framing, guidance, radio-host control, and scene-setting.
Not just a voice clone. Not a tokenizer. Inner Station is a three-lane OmniVoice architecture for turning direction, inner thought, and final spoken identity into one living MindExpander voice system.
The clear director voice: task framing, guidance, radio-host control, and scene-setting.
The inner station: dream reasoning, soft cognition, liminal monologue, and reflective texture.
The final public voice identity: the lane that carries the actual MindExpander sound and presence.
This public page documents the project and architecture. The heavy/raw media stays private. The canonical model identity is OmniVoice_MindExpander_FULL_TRAIN, and the active dataset is an OmniVoice-generated three-lane voice/persona package.
{
"audio": "mindexpander_output_full/audio/mindbot_000000_mindexpander_output.wav",
"text": "spoken transcript for the MindExpander output voice",
"duration": 3.5,
"dataset_id": 2
}
The public framework: name, architecture, lane contract, endpoint shape, demos, and docs. This is what people can understand and fork.
The actual fine-tune signal: audio paths, transcripts, durations, and dataset_id lane labels. This stays backed up and private unless approved.
First train the MindExpander output voice, then test role-aware multi-lane training for instructor, whisper, and output modes.
Generate the full OmniVoice three-lane dataset and verify audio quality.
Upload the private Hugging Face dataset with clone-portable manifests and receipts.
Run the special unified three-lane training: all lanes together, balanced, with explicit Inner Station role tags so the model learns instructor, whisper, and output modes instead of blending them.
Wire the voice into realtime agents, radio shows, command centers, and old-timey cyber-oracle broadcasts.