HOME | DD

Cabbagesaurus — OTO TUTORIAL
Published: 2016-08-20 15:36:51 +0000 UTC; Views: 36552; Favourites: 133; Downloads: 0
Redirect to original
Description body div#devskin0 hr { }

PLEASE READ!
So you've read my recording tutorial (hopefully, and thanks for doing so ), recorded your bank and imported it into UTAU. You've found the UST to the first song you want to cover, but what is this? No sound plays, or really strange static or a lot of other potato-ness. Oh my-
After a few struggles and some googling, you find that you haven't oto'd your bank. What?
You find that it is a form of programming of syllables- and that experienced oto-ers can be paid to do your work. Crap- you've only got 10 points... (Been there, done that)

Have no fear! This is what this tutorial is for.
This tutorial will ensure that your UTAU will sing smoothly and wonderfully between notes. Whether the notes themselves is wrong or the tuning not fine enough is related to the UST (guide for that MIGHT come up if I ever learn to tune well)

So prepare yourself for the long and tedious work that is OTO-ing. 
We'll be covering CV and VCV, since they're the most common. CVVC I have no idea how to oto, but I will provide a link to a guide in the final notes (that we were intending to use for a potential CVVC bank but things got too confusing/we were lazy). 

So here is a table of contents:
1. What IS oto-ing? What's all those lines and numbers?2. CV otoing3. VCV otoing4. Final notes 


1. What IS oto-ing? What's all those lines and numbers?Simply put, otoing is programming UTAU to tell them 'this is a consonant, this is a vowel, this is how long the sample is, please adjust it accordingly'. 
Let's go over the details now.
To open the relevant window,



And this opens....





You'll learn to become VERY familiar with this box. (I hate opening this box...)
Now on the right hand side you can see:
1. Name of the sample (sample is named as this in your folder)
2. Alias (another way to name the sample. You can see that this is a breath sound, and any note named 'ahh' or 'a R' will play this sound in the UST)
3. Offset - Silences any sound in this area. It is a purple/blue section.
4. Consonant - A pink section that tells UTAU what NOT to stretch. It should have the consonant and the vowel until it stabilizes.
5. Cutoff - Silences any sound in this area. It is a purple/blue section AT THE END (similar to offset just at the end instead of the beginning of the sound)
6. Preutterance - Red line that marks where the consonant ends and the vowel begins. 
7. Overlap  - Green line that marks the area that can be overlapped with the previous syllable. 

Selecting one of the samples in the box and clicking "Launch Editor" will bring you to this:


The coloured objects mentioned before in the list can be click+dragged
The three buttons on the bottom right do the following:
Up - Go to the sound sample above
Down - Go to the sound sample below (as ordered on the table in the Voice Configurations)
Close - Closes this editing box

REMEMBER TO CLICK OK TO SAVE YOUR SETTINGS, OR SET IF YOU MANUALLY EDIT THE NUMBERS IN THE BOX IN THE VOICE CONFIGURATION.


2. CV otoingThere are several decent guides out there for doing this actually, so I guess this is kind of... not new information.
The sound samples are split into three different types: Vowels, Hard Consonants, Soft Consonants
I'll explain how to oto each of them one by one.

BEFORE YOU BEGIN OTO-ING PLEASE DOWNLOAD ONE OF THESE TWO FILES AND USE THIS TO REPLACE THE OTO FILE IN YOUR VOICEBANK FOLDER.

- SAMPLE FILE NAMES ARE IN HIRAGANA: Romaji alias  (Note that these are numbers because this was copy-pasted from my UTAU's CV bank. Just edit the sliders. If you get a bunch of symbols, change to Japanese locale or use Applocale to open etcetc.)
- SAMPLE FILE NAMES ARE IN ENGLISH: Hiragana alias  

What are these you ask? These are oto files that have the aliases already done for you so you don't have to do them one by one on your own! Just edit them about. 

Vowels (and n)a/i/u/e/o + n



- Drag the offset (blue) till the wave has become stable. This is because for vowels, some of them are prone to having a little wonky sounds at the start (look carefully at the start of the blue wave. Can you see that it is a little different to, lets say the middle of the wave?), so you want to cut this out to ensure consistency. (Note: I did kind of cut off an excessive amount here. Anything from about 0.45+ is OK.)
- Drag the consonant (pink) over by about 0.1-0.2 seconds. (Or till the vowel has become stable)
- Leave the preutterance (red line) at 0
- Move the overlap (green) to halfway through the consonant (pink)

Then scroll to the end of the wave.


- Drag the cutoff (blue) to where the wave is still stable. This is to eliminate any fade outs/changes in voice as you finish the recording. 

Hard Consonantsb/ch/d/g/j/k/p/t



- Drag the offset (blue) to the start of the wave. (Note: If the consonant is excessively long, feel free to cut it down. It should only be maximum 0.1 seconds long.)
- Drag the consonant (pink) over by about 0.1-0.3 seconds. (Or till the vowel has become stable)
- Move the preutterance (red line) in between the consonant and the vowel. (This is usually where the wave changes pattern)
- Leave the overlap (green) at 0 (or move it back to negative a little) (Note: I prefer leaving it at 0)



Then scroll to the end of the wave.
- Drag the cutoff (blue) to where the wave is still stable. This is to eliminate any fade outs/changes in voice as you finish the recording. 

Soft Consonantsf/h/l/m/n/r/s/v/w/y/z



- Drag the offset (blue) to the start of the wave. (Note: If the consonant is excessively long, feel free to cut it down. It should only be maximum 0.1 seconds long.)
- Drag the consonant (pink) over by about 0.1-0.3 seconds. (Or till the vowel has become stable)
- Move the preutterance (red line) in between the consonant and the vowel. (This is usually where the wave changes pattern)
- Move the overlap (green) to halfway in the consonant.




Then scroll to the end of the wave.
- Drag the cutoff (blue) to where the wave is still stable. This is to eliminate any fade outs/changes in voice as you finish the recording. 

Help! I can't see where the consonant ends and the vowel begins.Have no fear! Amazing spectrum generator is here!


Clicking this circled button will make this happen to your box...




If yours comes up a bit faded and difficult to see the patterns, click the asterisk box that has now appeared next to the 's' box you clicked before
Then drag the slider up. This will improve the contrast.
Now, can you see the changes in patterns?
Vowels are usually several parallel white/slightly blue lines in the centre.
Consonants are usually a little more fragmented, like a cloud of light/dark blue pixels.

Keep at this for EVERY SINGLE SYLLABLE and you'll have yourself a nifty bank!
CONGRATULATIONS, YOU'VE NOW OTO'D YOUR CV VB AND IT IS NOW READY TO SING! Hope to see some lovely covers from you soon! -v-)/

3. VCV otoing
Now, I HOPE you've used a tempo guide. If you have, this will be a lot more painless than it needs to be. If not- a lot more of these will come out potato. 
We'll be using OREMO's VCV oto generator, so open up OREMO, change the directory folder to your voicebank.

Now, go to Generate oto.ini -> Kind of Utterance -> VCV



You'll then get this kind of window:



In the recording tempo, place the tempo you recorded at.
Now open up one of your samples. They should ALL begin at the same-ish time. Note down the start of this and place it in the box labelled "Utterance Start" and make sure you're in the right units! (800ms = 0.8 in the oto editing)

You can click "Initialise Parameters according to Recording Tempo" if you want, but it won't be perfect.
The guide for the numbers for each section is in this range:


Then click Generate Params (after you've made sure the boxes afterwards are ticked like this or something) 
And hope for the best. (If an error occurs, untick the Parameter Auto Correction 2, and if it still occurs, untick the Parameter Auto Correction 1)

Then BAM you will be prompted to save your oto.ini file somewhere (put it in your Voicebank folder) and now, comes the moment of truth.
Open a UST and run your voicebank through it.
If it sounds choppy anywhere, click on that note and while it is highlighted, open the Voice Configurations.
Now click Launch Editor


Sample of the first sound


Sample of every sound after the first

Here is the sample to reference. Note the following characteristics:
- Offset (blue) - cuts off all but the end 1/3rd of the first sound. (If you are checking the first sound, leave 1/3rd of a blank space)
- Consonant (pink) - Covers 1/3rd of the second sound. 
- Preutterance (red line) in between the end of the first sound and the beginning of the second sound (It's not perfect, but close enough. Considering that there are 1000+ samples to take care of, not all of them will be exact. It is up to your judgement as to what is 'close enough', but I think this is appropriate for the 'limit)
- Overlap (green) to halfway through the pink area of the first sound
- Cutoff (blue) - cuts off the end 1/3rd of the second sound. 

This applies to all sounds.
If any of these are out, then it will ruin the rest of that one sample, and you'll have to manually fix it (using the guidelines I provided above)
If a LOT of them are out, then you should change the numbers in the oto.ini generator. If the red is too far forward, reduce the number in the preutterance etc. 

Help! I can't see where the consonant ends and the vowel begins.Have no fear! Amazing spectrum generator is here!


Clicking this circled button will make this happen to your box...

If yours comes up a bit faded and difficult to see the patterns, click the asterisk box that has now appeared next to the 's' box you clicked before
Then drag the slider up. This will improve the contrast.
Now, can you see the changes in patterns?
Vowels are usually several parallel white/slightly blue lines in the centre.
Consonants are usually a little more fragmented, like a cloud of light/dark blue pixels.

Eventually after a lot of editing around, run through USTs, you'll get your perfect bank.
CONGRATULATIONS, YOU'VE NOW OTO'D YOUR VCV VB AND IT IS NOW READY TO SING! Hope to see some lovely covers from you soon! -v-)/

4. Final notesCVVC otoing - ch.nicovideo.jp/delta_kimigata… (Credits to the original writer for this)
To make the rendering of the samples faster:




Right click anywhere on the table (highlighted here by the big red box). Go down and click select (It's the bottom most option that has (M) ). Then, right click it again, and then click select all (New bottom most option that has (A) or something along those lines). This will make all the tables be highlighted in blue. Once you have gotten to this stage, click the smaller circled red box here, called Initialize freq. This will render all the frequency files so that they load faster when you play a UST. 

Thanks for reading, hope to see your UTAU up and about!

EDIT: 7/06/2021 While I appreciate the comments and would love to assist, I don't really dabble in UTAU anymore, so I wouldn't be the best person to be asking.

Related content
Comments: 70

ForgetMeAgain [2017-10-03 05:34:39 +0000 UTC]

Any advice if oremo refuses to execute oto ini command? After unchecking everything in VCV, well yeah it comes out blank. Darn. By chance do you know of anywhere of edit me otos?

👍: 0 ⏩: 3

ForgetMeAgain In reply to ForgetMeAgain [2017-11-10 13:24:54 +0000 UTC]

Sorry for the late reply and thank you for your reply Cabbagesaurus. Yes, unchecking all boxes wouldn't produce a result, it was a test.
In my case it was an execution error with script confusing  the parameters(likely just decompressed not accurate). I just re downloaded it on another computer, then to the original and it was fine for that.

MidnaMMD Did you record with a tempo? And it is vcv or cv? This is is important for you to "reinitialize" the settings in the generate oto box.
You will fill in your data and set/"apply" it. Than select the option to generate it and the folder it goes ,too

👍: 0 ⏩: 0

MidnaMMD In reply to ForgetMeAgain [2017-11-06 21:49:37 +0000 UTC]

Mine does the same thing, it just makes a blank oto file

👍: 0 ⏩: 0

Cabbagesaurus In reply to ForgetMeAgain [2017-10-03 13:24:38 +0000 UTC]

when you untick both boxes all the parameters go blank
you'll have to reinitialize them (theres a button for it)

otherwise send me a screenshot of your oto generator box

👍: 0 ⏩: 0

AkiCinnaBun [2017-09-26 20:12:38 +0000 UTC]

Thank you so much for this tutorial!! Thanks to you I have my first voicebank finished and ready to go~!

👍: 0 ⏩: 1

Cabbagesaurus In reply to AkiCinnaBun [2017-09-27 11:58:31 +0000 UTC]

glad it was useful! got any covers to show? :3c

👍: 0 ⏩: 1

AkiCinnaBun In reply to Cabbagesaurus [2017-09-27 20:55:39 +0000 UTC]

holy crud i got a little too excited about that question
I only have one at the moment since I just recently finished her bank  www.youtube.com/watch?v=v6ppw6…

👍: 0 ⏩: 2

pkunJP In reply to AkiCinnaBun [2018-06-26 05:28:51 +0000 UTC]

>///<

👍: 0 ⏩: 0

Cabbagesaurus In reply to AkiCinnaBun [2017-09-28 04:53:39 +0000 UTC]

! it came out pretty good!! greatest of jobs!

👍: 0 ⏩: 1

AkiCinnaBun In reply to Cabbagesaurus [2017-09-28 09:40:38 +0000 UTC]

Thank you!!

👍: 0 ⏩: 0

SparkyPsychc In reply to ??? [2017-07-05 09:16:43 +0000 UTC]

OHMYGOD~ Thanks! This is VERY helpful! Now I'll be making my first Voicebank came to life(?)!

👍: 0 ⏩: 1

Cabbagesaurus In reply to SparkyPsychc [2017-07-05 10:08:51 +0000 UTC]

glad it was helpful! i'd love to hear it when you're done

👍: 0 ⏩: 0

ShiroPiko [2017-04-09 17:23:18 +0000 UTC]

This is the only oto tutorial that had helped me. So simple and easy to understand too.

Thank you. TTwTT

👍: 0 ⏩: 1

Cabbagesaurus In reply to ShiroPiko [2017-04-09 23:56:54 +0000 UTC]

EH really!? i'm glad it's helped you!! >w<)/

👍: 0 ⏩: 0

BishieSan In reply to ??? [2016-08-21 07:08:33 +0000 UTC]

Bless this tutorial

👍: 0 ⏩: 1

Cabbagesaurus In reply to BishieSan [2016-08-21 11:44:09 +0000 UTC]

i forgot aku was going to make an utau
when when when?

👍: 0 ⏩: 1

BishieSan In reply to Cabbagesaurus [2016-08-22 04:14:37 +0000 UTC]

Once I get my mic ah <- still saving up for one

👍: 0 ⏩: 0

shobon-dama [2016-08-21 00:31:50 +0000 UTC]

nicely written fam

👍: 0 ⏩: 1

Cabbagesaurus In reply to shobon-dama [2016-08-21 01:42:22 +0000 UTC]

thanks bruh

👍: 0 ⏩: 0


<= Prev |