Software Guide

About 1267 wordsAbout 4 min

GuideSoftware

Software Guide

This document provides a detailed user guide for Cyrene QwenTTS GUI, helping you quickly get started and make full use of the software's features.

Installation Guide

System Requirements

Operating System: Windows 10/11 (64-bit)
Processor: Minimum Intel Core i5-12400 or equivalent performance processor
Memory: At least 16GB RAM
Storage: At least 12GB free space (runtime environment ~5GB, default model file ~4GB)
Graphics Card (Not Required): CUDA-compatible graphics card
Network: Internet connection required for first use to download runtime environment and models

Installation Steps

Download Program
- Download the latest integrated package or executable file and requirements.txt from GitHub Releases page
- Select the version suitable for your system (Windows 64-bit)
Extract/Prepare Environment
- If you downloaded the integrated package, extract it directly to the target folder
- If you downloaded the executable file and requirements.txt, place them in the same folder

First Launch

When launching the software for the first time (non-integrated package), the system will:

Initialize Configuration: Create default configuration files
Install Environment: Install necessary Python libraries (based on requirements.txt and program built-in)
Download Models: Download the default Qwen-TTS model
Welcome Wizard: Display a welcome wizard to guide you through initial setup

Why not write the libraries in requirements.txt into the program:

Because the libraries built into the program are tested and verified, while the libraries in requirements.txt vary according to user needs and environment (e.g., users need specific versions of libraries).
The built-in libraries can ensure the normal operation of the software, while the libraries in requirements.txt can be customized according to user needs (e.g., installing specific versions of libraries).

Basic Usage

Voice Generation (Model Qwen3-...-CustomVoice)

Enter Text
- Enter the text to be converted in the text input box in the central workspace
- You can enter multi-line text, and the system will process it automatically
Select Model
- Select an appropriate model in the "Model Selection" section of the left panel
- Recommended to use "Qwen3-...-1.7B-CustomVoice" for best results (if your device performance is limited, please use 0.6B)
Select Speaker
- Select an appropriate speaker in the "Speaker Selection" section of the left panel
Generate Voice
- Click the "Generate Audio" button
- Wait for the system to complete processing (processing time depends on text length and hardware performance)
Preview and Save
- Click the "Play" button to preview the generated voice
- When satisfied, click the "Save" button to save as an audio file

Voice Design (Model Qwen3-...-CustomVoice)

Enter Voice Description
- For example: A young female, sounds happy
Enter Text
- Enter the text to be generated in the text input box
Generate Voice
- Click the "Start Generation" button
- Wait for the system to complete processing (processing time depends on text length and hardware performance)
Preview and Save
- Click the "Play" button to preview the generated voice
- When satisfied, click the "Save" button to save as an audio file

Voice Cloning (Model Qwen3-...-Base)

Switch to Voice Cloning Interface
- Click the "Voice Clone" tab in the top navigation bar
Upload Reference Audio
- Click the "Browse" button
- Select an audio file containing clear speech (recommended 5-10 seconds or longer)
- Enter the text of the reference audio (optional)
Enter Text
- Enter the text to be converted in the text input box
Generate Cloned Voice
- Click the "Clone" button
- The system will generate a voice imitating the style of the reference audio (processing time depends on text length and hardware performance)
Preview and Save
- After generation is complete, the program will automatically play the cloned voice
- You can click the "Play" button to preview the generated voice again, or drag the progress bar to adjust the playback position
- When satisfied, click the "Save" button to save as an audio file (if you forget to save, don't worry, the program will automatically save to the default path /outputs)

Audio Browser

Switch to Audio Browser Interface
- Click the "Audio Browser" tab in the left navigation bar
View Audio List
- All generated audio files will be displayed in the list
Play Audio
- Double-click the audio file in the list to play it
- You can use the progress bar to adjust the playback position

Advanced Features

Voice Presets

The software provides multiple voice presets to help you quickly apply specific voice styles:

Default: Standard voice style
Sweet: Sweet and cute voice style
Mature: Mature and steady voice style
Professional: Professional broadcasting voice style
Friendly: Friendly and natural voice style
Passionate: Passionate voice style

Custom Presets

You can create and save your own voice presets:

Adjust Parameters: Adjust voice parameters to your satisfaction
Save Preset: Click the "Save Preset" button
Name Preset: Enter a name for your preset
Apply Preset: Select your saved preset from the preset list

Troubleshooting

Common Issues

Model Download Failed
- Cause: Network connection issues or model server temporarily unavailable
- Solution: Check network connection and try again later
Voice Generation Failed
- Cause: Text too long or model loading failed
- Solution: Shorten text length, or reload the model
Poor Voice Quality
- Cause: Inappropriate model selection
- Solution: Try using a higher parameter model (1.7B)
Software Crash
- Cause: Insufficient system performance/resources
- Solution: Try clearing background programs and restarting the software; or switch to another computer~

Contact Support

If you encounter problems that cannot be resolved, you can contact support through the following methods:

GitHub Issues: Submit an Issue in the GitHub repository

Performance Optimization

Hardware Optimization

Use GPU Acceleration: Recommended to use NVIDIA graphics cards
Increase Memory: For processing long text, 16GB or more memory is recommended
Use SSD: Install the software and models on an SSD to improve loading speed (if you don't have an SSD but have more than 32GB of memory, you can also try installing in RamDisk, storing the software in memory, but remember to handle persistence storage (copy files from RamDisk to hard disk before shutdown))

Text Processing Optimization

Segment Processing: For long text, it is recommended to process it in segments for better results
Avoid Complex Formats: Try to use simple text formats and avoid excessive special symbols
Use Punctuation Properly: Use punctuation marks appropriately to get more natural pauses

Advanced Configuration

Frequently Asked Questions

Q: Does the software require an internet connection?

A: Internet connection is required for first use to download environment/models, subsequent use can run offline. (Note: The integrated package does not include models)

Q: Can the generated voice be used for commercial purposes?

A: Please refer to the Qwen-TTS model license agreement and relevant laws and regulations.

Q: What audio output formats are supported by the software?

A: WAV format is supported.

Q: How to uninstall the software?

A: Simply delete the directory where the software is located.

Copyright Information

Cyrene QwenTTS GUI

Author: Cyrene2008 UI designed by Cyrene2008
Version: v0.1.0
License: GPLv3 + Additional Statement (see https://github.com/Cyrene2008/Cyrene-QwenTTS-GUI/blob/main/LICENSE for details)
Project Address: https://github.com/Cyrene2008/Cyrene-QwenTTS-GUI

This software is based on the following open source projects:

Qwen-TTS: https://github.com/QwenLM/Qwen-TTS
PySide6: https://wiki.qt.io/Qt_for_Python
FluentUI: https://github.com/microsoft/fluentui

Disclaimer

This software is for personal learning and research use only
Please comply with relevant laws and regulations and do not use it for illegal purposes
The author is not responsible for any consequences arising from the use of this software
The voice models available in the software may have certain limitations, and the function to allow users to load other models may be added in the future

Contributors

星海昔涟

Changelog

2/17/26, 1:34 PM

View All Changelog

de92c-我是傻逼on 2/17/26