top of page

このホームページは

を使って作成されました。あなたも無料で作ってみませんか？今すぐはじめる

θ檔案室

- Theta Archive Room -

Creating a Fully Artificial Voicebank

Wip

00

01

Audacity: used for most of the editing
Adobe Audition: post editing such as erasing clicks, clipping
Paint.wav: generates noise based on drawings
Vocalizer: makes your vowel sound more like vowels
Vocoder: does the same thing as vocalizer

02

For the audio part of Fermi's voicebank, I was inspired by Curiosity, the singing rover,who sounds like this

00:00 / 00:08

Out of all the synthetic wave sounds, square waves sounded the most similar. I created the vowels in Audacity by stacking square waves at different frequencies and volumes (Somewhere around the 5~10 minute mark)

Then, by using the plot spectrum tool, I analyzed the frequencies of my own pronunciations and used the filter curve eq to make each vowel sound more like "me".
If needed, I also used a little bit of vocalizer.

Which ended up with this result:

00:00 / 00:04

The voice came out pretty deep, and didn't fit well with what I wanted for Fermi(which I envisioned as a really high pitched, sharp "cutesy" voice)

I started fiddling with ways to make the voice sound "cuter"
which usually involves raising the pitch using an Audio editor. However, I found that by doing so, it won't render properly in UTAU. I didn't really feel like looking through specifically why this happens, but the simple solution that worked for me personally was to just use the g- flag in UTAU and render out that audio.

After throwing Fermi's vowels through tn_fnds with ~g-30, she sounded a lot better:

00:00 / 00:05

bottom of page