shimmerloid-ai - Tumblr blog

shimmerloid-ai · 2 months

Text

Preliminary Considerations - Which Vocal Synthesizer Software is Right for You? - Free Softwares

Although this blog puts a major emphasis on the VOCALOID 4 editor, it, or VOCALOID in general, is not the only vocal synthesizer that exists. There are tons of other software that have the same function and a variety of different voicebanks, with some being cheaper and of higher quality than VOCALOID, or even free! That’s right, there are quite a few free vocal synthesizers out there (however the lack of frills may come at the expense of some missing features or difficulty of usage), which I recommend trying out before pouring your hard-earned savings on a program that you may not even use. What if you learn that you do not enjoy tuning or do not have the time to use the software? It would be a huge waste of money that could be invested in other stuff, such as basic necessities (GOOD FOOD) or other leisurely items, like video games, clothing from your favourite bands, art supplies, or merchandise. In addition, there has been a rise of a lot of smaller companies coming out with vocal synthesizers with incredible UIs that not only look appealing but are easy to navigate, and voicebanks that sound far too human and advanced than hATsUnE mIKU (don’t worry, I love Miku with all my heart, I am just trying to prove a point here). There are also some really sick features that you may not find in the franchises with bigger names.

In this post, I will be describing the features of different free vocal synthesizers and their advantages and disadvantages so you can find the one that meets your vocalo-p needs. Please note, I do not own all of these synthesizers, some of these are from reviews on Reddit and VocaVerse Network. In addition, some cons like lag could just be a me problem and better computers may not experience such issues. Also , I will not be covering every single singing synthesizer in existence, just the well known ones and those with proper UIs because there are so many. I am omitting NEUTRINO because it does not have a UI despite having such high-quality vocals, along with ALTER/EGO, as it does not have a piano roll.

UTAU

(Song: Meltdown by iroha(sasaki); UST: Tanjiro Taidana)

UTAU was designed to be the free sister software to VOCALOID. Not only can you use it without spending a cent, but it allows you to make your own voicebank as well! There are tons of popular voicebanks out there, including the Vipperloids, Gahata Meiji, Kohaku Merry, Matsudappoiyo, Denatsu Sora, Shuu Mawaine, and my personal favourite, SUZU.

Pros:

Almost every single voicebank is free to download

Different types of voicebanks (CV, VCV, CCVC; Monopitch vs. Multipitch; Power, Weak, Soft, Whisper, Growl, Screamo; tons of languages)

Can make your own voicebank right in the software

Pitch bending on the piano roll instead of a parameter box!

Variety of job plugins to make usage easier

Credited for its growl and vibrato handling

Cons:

EXTREMELY dated, UTAU has not been updated since 2013

Not friendly for beginners, especially due to its old UI

Need to change your system's locale, and installing voicebanks can be frustrating

Most voicebanks (namely Japanese) can only read Hiragana phonemes and not Romaji ones; but job plug-ins can fix this issue

youtube

Open Utau

(Song: The Lost One's Weeping by neru; UST: Tanjiro Taidana)

If UTAU is the sister software to VOCALOID, then Open Utau is the younger sibling to normal UTAU. Open Utau is an open-sourced vocal synthesizer on GitHub with every feature in the original software while being easier to use.

Pros:

Dark mode with a sleek, easy-to-navigate UI!

Pitchbend with a click of a button; piano roll tuning is still consistent

Splice tool; useful for note-bending

No need to switch locale to Japanese

Easier to get the hang of

Frequent updates

Can use VSQXs and svps. without needing to convert them into USTs

Cons:

No Defoko…

Choppier and buggier than classic UTAU

Slow with rendering wav. files and launching the software

Phonemizers are tricky to work with, you don’t always get the same output as the same phonemizers in normal UTAU

External resamplers can cause overheating and slow down the software

Tuning is more dependent on job plugins than the original UTAU

Many users claim that otoing is easier in classic UTAU

youtube

SynthesizerV Studio Editor R1

(Song: Tengaku by Yuuyu; VSQx by Adam Edmond)

This preliminary edition of SynthesizerV was a major breakthrough for the vocal synth community when it was first released. With its realistic-sounding voicebanks and minimalistic aesthetic, this software has changed the game by a landslide for synth users. Although it's quite limited, R1 was an amazing start for what will become a godly program in the future. Pros:

Pitch bending on the piano roll and in the parameter box (very smooth, I experienced no lag when using it nor did I have to make pitch points or pause while editing the parameters)!

Voicebanks sound quite human

MIND BLOWING GLOTTAL EFFECTS (nine different growls, two screams, and a vocal fry that do not sound robotic!)!

Really simple UI, easy to pick up, great for beginners!

Cons:

Outdated; is no longer being updated by Dreamtonics

Needs a recording license for commercial use (though I highly doubt it is still being upheld)

Only four voicebanks are available; Eleanor Forte, Yamine Renri, GENBU, and AiKO - who is paid and an outdated version of her R2 voicebank (R2 versions of the same voicebanks sound much cleaner and realistic)

A little too minimalistic; aside from the addition of glottal effects and the typical pitch deviation, loudness, tension, breathiness, voicing, gender, and vibrato parameters, there is not all that much you can do in this edition of the editor

youtube

SynthesizerV Studio Basic

(Song: Antibeat by Deco* 27; UST: Mayu Sama Desu)

Also known as SynthV R2, this is the free edition of the software that is currently being updated, despite having fewer features than its complete, paid version.

Pros:

Ready to play with as soon as it is installed

Twenty-five free voicebanks; sixteen Japanese, seven English, and two Chinese; all with unique sounds

AI voicebanks!

Instant mode; allows you to automatically tune the pitch of an entire track with the press of a button, although it may make the voice sound too pitchy

Waveform that allows you to see the volume and pronunciation of certain notes

Can use paid voicebanks in the free editor!

Just as easy to figure out as SynthV R1!

Cons:

Can only have a maximum of three vocal tracks in a single svp. file

Pitch bending is a lot more finicky compared to SynthV R1

Lite voicebanks sound mono-pitch

Lacks a ton of features that are available in SynthesizerV Pro; scripts, auto-pitch tuning, rap vocals, cross-lingual synthesis, vocal modes, alternate phoneme choices, and many other features are not included in the basic edition (even paid voicebanks can not use cross-lingual synthesis, vocal modes, etc)

The glottal effects parameter that was in SynthV R1 is sadly not included in both the Basic and Pro editions of the current program

youtube

VoiSona

(Song: iNSaNiTY by Circus-P; VSQX: Cirty_09)

Previously named “CeVIO Pro”, VoiSona is a vocal synth that uses AI technology to create beautiful vocals with characters that originated from a variety of other vocal synthesizers (such as VOCALOID!) and are created with the recordings of talented singers and voice actors. CeVIO project has also launched a trial speech vocal synthesizer called “VoiSona Talk” for their first anniversary.

Pros:

Users get Chis-A’s full voicebank upon downloading the synthesizer!

The program itself is entirely free to download

AI technology makes tuning easier

Piano roll pitch-bending

Has some features that are missing in its sister software, CeVIO AI

The “husky” parameter is great for making whispers

Can be used as a VST plugin in most DAWs or a standalone editor

Cons:

All other voicebanks are paid; either you purchase the entire voicebank once, or get a subscription to use all of them

HEAVY LAG; the program is quite slow with processing commands

Free-hand pitch-bending is not as easy to perform compared to UTAU or SynthesizerV; can be quite sensitve and the AI may not always yield the desired result

youtube

DeepVocal

(Song: New Darling by MARETU; UST: Mimisan15)

The successor to the Sharpkey Galaxy software, this vocal synthesizer was designed for Chinese voicebanks. Its UI is a combination of VOCALOID4 and UTAU, giving it a sense of comfort and familiarity. Speaking of which, you can create your own voicebank in DeepVocal as you can in UTAU and OpenUtau, and there are some pre-made voicebanks of popular UTAUs, including Namine Ritsu, Inari Akane, and Kuro Bousuka. In addition, there is also a Kiana voicebank commissioned by MiHOYO and based on the protagonist of Gun Girls Z and Honkai Impact 3rd!

Pros:

Ready to use as soon as its out of the box

Great engine for Chinese voicebanks

Can create your own voicebank

Runs smoothly

Has all of the necessary parameters needed to create songs and covers

Cons:

Voicebanks can be kind of shaky, choppy, and more sensitive to pitch changes compared to other engines like UTAU and VOCALOID

Pitch bending can be quite clunky

Voicebanks may have difficulty reading certain phenomes from converted USTs; you may need to edit them if you don’t want lyrics being read as “a” or “ra”

youtube

These were all of the major free softwares I found, but if I come across another vocal synthesizer in the future, even if it is not talked about in the vocal synth community much, I may make a post about it.

I know there are a ton of cons I found for much of the vocal synths on this list and they sound like nitpicks on my part, but as I stated at the start of this post, some of these issues could be a Shimmer Thing™ and they may not arise for you when using these softwares. I won't be surprised if you read through this post and are now feeling thrown off by the various features and pros and cons of these programs, so here's my two cents on what I think beginners should go for:

If you like realistic voicebanks and want a very simple software to start with, get either SynthesizerV Editot (R1) or SynthesizerV Studio Basic (R2). If you would like to experience them (spicy) glottal effects and very kind pitchbending (like it does not make you want to bash your head against the wall because Renri won't cooperate) along with unlimited vocal tracks, then try out R1, and if you want more features, voicebanks, and continous updates, go for R2. Or even better, try out both and decide which one suits your interests better.

If you have a preference for robotic voicebanks, would like a variety of vocals to play with, and find plug-ins interesting, then UTAU may be for you, especially if you want VOCALOID but you can not afford it at the moment. Although I shitted more on Open Utau than I did on regular UTAU, I recommend the former over the latter as it is still being updated and the UI is signifcantly easier to navigate, along with its phenomenal pitchbending function.

Finally, please take my words with a grain of salt. If you like the voicebanks or are interested in a specific software, or discover one that is even better than any of the listed vocal synthesizers, by all means, go for it! This is just a surface guide by an idiot who spends most of their time trying to make Fukase not sound like a computer dying, and I have not used any of these softwares as much as I have messed with VOCALOID. Plus, my computer is an absolute bitch, so you guys will probably have much better luck than me.

I hope this guide was of use and provided a better insight on the various engines out there. My next post will compare different paid vocal synthesizers, including CeVIO AI, Piapro Studio, and of course, the various VOCALOID softwares. Don't worry, I'll get to the actual tutorial bit very soon.

Also, feel free to ask any questions about vocal synthesizers, or... literally anything! I'm practically starved for asks-

Thanks for reading!

#long post #vocal synth #vocalop #utau #synthesizer v #vsynth #open utau #synthv #deepvocal #voisona #chis a #kasane teto #eleanor forte #tanjiro taidana #vocal synthesizers #vocaloid #vocaloid resource #i know this is long im sorry for wasting your time

7 notes · View notes

shimmerloid-ai · 2 months

Note

CANON CUS I SAID SO

Opinions on my fukalen au..?

Um so all the vocaloids live in a little cottagecore and cozy town and fukase owns the loc tavern and Len comes in every day, not to drink (Len can barely stomach alcohol) but to chat to fuka since he secretly has a massive crush on him ( I say secretly but in reality everyone knows Len has a massive crush on him) and fukase let's him stay after closing time and makes him non-alcoholic drinks

They're my queers

Ahhh I love them!! I’m sorry for answering this so late, I wanted to draw something for it I kept forgetting,,

But this is so real and true I believe in lightweight Len who sticks around just cuz he has a super obvious crush- they’re our queers now!

#vocaloid #vocal synth #kagamine len #len kagamine #fukase #vocaloid fukase #fukase vocaloid #len x fukase #lenkase #fukalen #fukase x len #cottagecore #lens a fuckin lightweight #i love gay people

45 notes · View notes

shimmerloid-ai · 2 months

Note

(hi yu) not a vocaloid ask, but im here to say if you dont follow my friend here, i will eat everyones toes. so unless you want to be toe-less, follow @/shimmerloid-ai :)

hey bestie!

also yeah, ya'll better follow if you want to keep your toes intact >:D

#ask

0 notes

shimmerloid-ai · 2 months

Note

Really useful guide for pitchbending in VOCALOID4 and VOCALOID5 as someone who doesn't pitchbend or uses V5.

id love some tuning advice... im so bad at it JEJDNDNMF

its all in the pitchbends, friend!

but first off SET AND SYNC YOUR FUCKING BPM CORRECT you need an external program to sync you bpm to your off-vocal file please view this video for more info (this is for utau but utau is just Vocaloid for poor people who love diy. i respect them.). do not do ANYTHING until this is set up or your life will suck.

i personally dont do any of my vsqs (or rather vprs because i prefer working with v5) starting in an external program, i only use them to sync up. but thats because i use vocaloid 5. vocaloid 5 (or at least the newest version) runs fast and its easier to move around notes and change things while the song is still playing, which v4 can NOT! so if you only have v4 and earlier versions, map out your stuff in an external program like in the video. fl studio works.

this is important, make your lengths large, as most songs don’t require 1/64th of a beat. use this to map out your base pitches.

like so

and you end up with this

super fucking boring. dont stop here. im begging you. dont stop here. nobody irl sings like this.

you need to mess around w/ singer parameters yes but you need to tune through pitchbends more importantly!

lets take this for example, its a simple melody in C Major:

Keep reading

#vocaloid #vocaloid tutorial #vocaloid resource #vocaloid4 #vocaloid5 #pitch bending

64 notes · View notes

shimmerloid-ai · 2 months

Text

WHAT BOOK IS THIS?!?!?

Rereading the books and huh??

3K notes · View notes

shimmerloid-ai · 2 months

Text

Introduction - Vocal Synth Terminology - Part 2

This is a continuation of my previous post. I highly recommend checking that one out first before reading this post.

Append: This term was originally used to describe an additional voice library, originating with Hatsune Miku Append (shown below) and Kagamine Rin and Len Append. Appends provide vocal synths with different tones, such as Miku’s Soft append giving her a more gentle, breathy voice, and her Dark append, which provides her with a more mature, motherly tone. Now, the word “append” can be used for any vocal synth of a character that differs from their usual voice, such as Gumi’s “Adult” voicebank, though the correct word to address these voicebanks is just… voicebank. Proper appends are commonly seen among UTAU voicebanks, like Kasane Teto’s Weak Append, and Yamine Renri’s EDGE Append!

Fanloid: Also known as "derivatives", these are fanmade vocal synth characters. Fanloids do not have their own voicebanks, instead, their voices are made by editing the parameters of a pre-existing vocal synth. For example, Akita Neru (shown below) is a high-pitched Miku or a low-pitched Rin, Yowane Haku is a low-pitched Miku, Hatsune Mikuo is literally a gender-bend of Miku with the voicebank’s gender factor increased, and Honne Dell made with Kagamine Len with his gender and brightness factors lowered.

Realistic Voice Cloning: Also known as RVC, this phenomenon is loathed by every vocal synth user. If you have been on the internet since 2023ish, you have probably seen things like “Mr. Beast AI sings IDOL by YOASOBI” (shown below), or “Dio Brando sings Colleen Ballinger’s Apology”. These AI modules basically involve taking voice samples of a celebrity, fictional character, or literally ANYTHING, and creating a voice module that can sing any audio track provided to it. It should not be a big surprise that AI has made its way to the music industry as well, but as cool as they sound, most RVC modules are illegal and essentially harmful to the vocal synth community, and the music industry in general. People make these modules without the knowledge of the “voice providers” and gain a ton of views for doing LITERALLY NOTHING aside from mixing the vocal track with the instrumental, and often make money off of someone else’s voice. They could also make the module say something obscure, offensive, or lewd, and put the blame on the voice provider. I do not think we need to go into details about what is wrong with these consequences, but long story short, they fucking suck. The icing on the cake is that people would make AI modules of VOCAL SYNTHS as well, which is why most people think Miku is like Squidward AI or whatever. Not only does it harm the voice providers who put their time and effort into creating amazing voicebanks, but it also gives the public a nasty impression of the producer and cover artist community.

youtube

Jinriki UTAUs: Similar to the RVC phenomena, these are UTAU voicebanks that are made without the knowledge of the voice providers. Again, Jinrikis can be anything, from Mario to Baldi from Baldi’s Basics in Education and Learning, to a YouTuber or a LoveLive! character. Yeah… they can be quite cursed. Like RVC, these voicebanks can be considered illegal, however, it's not much of a problem if you keep these voicebanks to yourself, don’t run around advertising them, distribute them, or use them for commercial use. Jinriki voicebanks should not be confused with porting voicebanks (second video below) from other software, like putting MEIKO into Open UTAU. This is completely okay, so as long as you do not distribute them and port the voicebank yourself.

youtube

AI Voicebanks: These are actual paid voicebanks that use AI technology to enhance the quality of the vocals to make the tuning sound realistic and to make the overall tuning process much easier for producers. The AI technology would analyze the pitch deviation and other factors of a file and smooth out and clean up anything that sounds too flat or janky. In fact, even when a file is untuned are flat, AI voicebanks will try to create a smooth transition and can provide users with a nice start. As useful as they sound, sometimes these voicebanks can be unpredictable and not provide you with the desired output, but with a bit of practice you can create beautiful melodies with them. The popularity of AI voicebanks started with CeVIO and SynthesizerV, but now we have AI voicebanks in VOCALOID as well! However, the V6 voicebanks are… not exactly the best, but I will get into them in another post. The most popular AI voicebanks as of now are Chris-A (CeVIO + VOISONA), Eleanor Forte (SynthesizerV), KAFU (CeVIO + SynthesizerV), ONE (Synthesizer V), Megpoid Gumi AI (VOCALOID + SynthesizerV Studio), and recently, Kasane teto AI (SynthesizerV; shown below)!

youtube

Diffusion Singing Voice Conversion Model: Commonly known as Diff-SVC, this a program used to make AI voicebanks that works similarly to UTAU; you make a voicebank with your own singing, and use AI to make it into a voicebank that sounds human. These voicebanks are slowly “trained” to sound more realistic, and can produce beautiful vocals! These should not be mixed up with RVC modules as these voicebanks are either made by the voice providers or with their consent, and they have to be tuned like in any other synthesizer software.

youtube

Talkloid: These are commonly seen in memeish side of the vocal synth community. Basically, instead of making voicebanks sing, you tune them to make them talk. Talkoids are typically shitposts of vocal synths in bizarre scenarios (with a ton of swearing and Generation Z/Alpha jokes… some talkloids are really unhinged and/or cursed), but there are a few hidden gems such as cute conversations or stories tying to a producer’s lore about their favourite vocal synths.

youtube

PV: This acronym stands for Promotional Video. PVs accompany vocal synth songs and covers to enhance the listening experience . PVs can range from still artwork with lyrics underneath, to animatics and full-on animations.

youtube

Project Diva: A rhythm game series starring the Cryptonloids alongside Kasane Teto, Yowane Haku, and Akita Neru created by SEGA. Not only are they are known for their nightmarish beatmaps (looking at you, The Intense Singing of Hatsune Miku), but their stunning 3D PVs as well. Producers would often record PVs of the vocal synth of their choice in whatever costume they desire, and if the singer is a character who is not in the game, there are tons of mods on GameBanana (example with Otomachi Una shown below; the Teto AI Ghost Rule cover I posted above is another instance of this)!

youtube

Project Sekai! Colorful Stage!: Hell. Absolute hell. Just kidding, I have a strong feeling that you already know what this is, but in case you do not, Project Sekai! is a mobile rhythm game created by Sega and Piapro starring the Cryptonloids and five music groups of original characters, all with their own stories and songs. Like any other mobile rhythm game, you can pull for characters you want with gacha currency and participate in events that may or may not relate to the main story for prizes. A lot of new VOCALOID fans come from the Project Sekai! fandom, and although much of the fandom is a mess of entitled, bratty teenagers on Twitter, some people genuinely want a better understanding of how vocal synths work, softwares aside from VOCALOID, and songs aside from those in the game. This is one of the reasons why I made this blog, to properly explain how vocal synths work, and I hope this will give you a better understanding of this precious community. Also… I may add my EN ID to my about page. Maybe.

MikuMikuDance: Commonly known as “MMD”, this is a community-driven animation software. In MMD, you can create your own motions, cameras, stages, accessories and models, to create stunning PVs. You can also use other software like Blender to enhance your animations As the name implies, MMD was initially designed for VOCALOID and UTAU fans to create music videos of their favourite characters, but now it has evolved beyond great lengths. You are sure to find models and stages from your favourite video game or TV/anime series, and there are tons of motions and cameras out there. If you were in the vocal synth fandom in the late 2000s’, you may remember the MMD CUP series. These animations are an example of common memes people will make with this software, alongside PVs for Talkloids or a short motion of your favourite character doing whatever dance is currently popular on TikTok.

youtube

Digital Audio Workstation: Know by the shorthand “DAW”, these are softwares used for making music! DAWs are extremely versatile, allowing you to create a huge variety of projects in them. You can mix an exported wav. file of a tuned VSQx with an off-vocal track, edit audio in general, or use the virtual instruments to create your own arrangements and arrangements! In the vocal synth community, people do all of these things! The most popular DAWs include FL Studio (shown below), Studio One, Logic Pro, Cubase, Ableton Live, Reaper, Cakewalk, and Waveform! Unsurprisingly, DAWs are quite expensive, usually costing more than the actual vocal synthesizer as well. Not to mention, plug-ins can add additional expenses. When picking a DAW, it’s honestly better to pick quality over cost, all while seeing what fits your budget and needs. Oh, I’m going to regret exposing myself here, but in case anyone asks, I have no idea how to mix and probably will not make a tutorial on it. I have never used a proper DAW aside from Audacity and Soundtrap by Spotify, and those mixes turned out HORRENDOUS. Not to mention, I’m practically broke so I can’t invest in a good DAW. I’m trying to figure out how to use FL Studio 20 (the free, limited version of this software), but my smooth brain can not wrap my head around the process, even with two helpful tutorials I found. Plus, I do not have the time to learn. However, I may share some resources in the future relating to using a DAW from people with a brain cell, so keep your eye out for those. But I promise, the day I get decent experience in mixing is when I will make a guide on it.

That is all of the definitions I can think of for now! I know I did not go over all of the parameters like "growl" and "cross synthesis", and that is because I will explain how all of the parameters work in a planned post. I also took some notes from Minnemi's video on basic VOCALOID terminology and the Vocaloid wiki. If I am able to think of any other important jargon later on, I will update this post. Thank you for reading this! I hope your knowledge on vocal synths has expanded, and I apologize for the huge post!

#vocaloid tutorial #vocaloid #vocal synth #vocaloidproducer #ai that are not paid voicebanks must die #mmdmikumikudance #mikumikudance #project diva #fanloid #synthesizer v #utau #long post #vocaloid resource

3 notes · View notes

shimmerloid-ai · 2 months

Text

Introduction - Vocal Synth Terminology - Part 1

This post will be split into multiple parts due to Tumblr's character limit.

If you are new to the Vocal Synth community, you may encounter some words and phrases you don’t understand. For instance, someone may tell you about Rin and Len’s appends, and you may confuse that term for the difficulty in Project Sekai! Colorful Stage! Or may have heard someone discussing USTs, but can not find its definition anywhere nor figure out what the hell they are talking about.

Well, I made a dictionary of sorts to help newbie fans get used to Vocal Synth jargon. The keyword is “Vocal Synth” as these apply to other software as well. These definitions have a greater focus on the programs themselves than the characters themselves.

Credits to Vocaloid Wiki and Minnemi on YouTube for some of these definitions.

Vocal Synthesizer: A digital instrument that creates tracks like any other DAW, but instead of piano notes, guitar strums, or drum beats, you compose vocals! Also known as “vocal synths”. Examples of vocal synthesizers include VOCALOID, UTAU, SynthesizerV, CeVIO, and Piapro Studio.

Voicebank: A collection of recordings of the sounds that make up a language. These sounds are typically vowels and constants, but depending on the voice bank, you may also get breath notes and pronunciation effects. Or, in simpler terms, the singers that are used in vocal synths! There are ton of voicebanks in the vocal synth community, with some of the popular ones being Hatsune Miku (VOCALOID + Piapro Studio), Kagamine Rin and Len (VOCALOID + Piapro Studio), Megurine Luka (VOCALOID + Piapro Studio), Kasane Teto (UTAU + SynthesizerV), Megpoid Gumi (VOCALOID + SynthesizerV + A.I. VOICE, FineSpeech Ver3), flower (VOCALOID + Gynoid Talk + CeVIO), IA (VOCALOID + CeVIO), and KAFU (CeVIO + SynthesizerV)! Individual vocal synth characters can also have different versions of their voice, such as Yuzuki Yukari’s Onn (soft) and Lin (power) voicebanks!

Voice Provider: The person whose voice that a voicebank is created. Voice providers record samples of their voice (specifically vowels and constants) at a certain key (for instance A3), which are turned into a voicebank with the company’s black magic (I’m kidding, I don’t know how they process and put the vocals together). For instance, PIKO is Utatane Piko’s voice provider, Satoshi Fukase is Fukase’s voice provider, and Naoto Fuga (shown below) is KAITO’s voice provider!

Crypton Future Media: The brains behind some of the most popular VOCALOIDs, which are Hatsune Miku, Kagamine Rin, Kagamine Len, Megurine Luka, KAITO, and MEIKO. Aside from voicebanks, they created games, concerts, merchandise, and much more relating to these beloved VOCALOIDS! Cryptonloids are… VOCALOIDS created by Crypton. Soon, Crypton departed from Yamaha and made its own vocal synthesizer in affiliation with another company called Piapro named Piapro Studio. There are two versions of this software; Piapro Studio NT and Piapro Studio V4x.

UTAU: A vocal synthesizer that is considered the “sister” software to VOCALOID. Unlike VOCALOID, this software is 100% free and you can create your own voicebank. There are thousands of UTAUloids at this point in time, giving you a huge selection of different ranges and strengths. Popular UTAUloids include Utatane “Defoko” Uta, Kasane Teto, Namine Ritsu, Momo Momone, Yowane Ruko, Sukone Tei, Rook, Gahata Meiji (shown below), Yamane Renri, Matsudappoiyo, Keine Ron, Kohaku Merry, Gekiyaku, Kazehiki, Adachi Rei, Ooka Mika, and so many others! There is also an open-source version of UTAU called Open UTAU, which is much easier to install and use (it has a dark mode!). Vipperloids are the classic UTAUloids that share surnames ending with “-ne” and their VOCALOIDish designs. These include Utatane “Defoko” Uta, Kasane Teto, Namine Ritsu, Momo Momone, Yowane Ruko, Sukone Tei, and many others.

SynthesizerV Studio: Also known as SynthV, this is a vocal synthesizer made by Dreamtonics that is well-known for its AI voicebanks. For a software that is smaller than VOCALOID, they are extremely advanced with realistic-sounding voicebanks, piano-roll tuning, rap vocals, and so many other features. It’s also much cheaper (thank you, Yamaha money sharks). In addition, Dreamtonics has two free versions; SynthesizerV Studio R1, and SynthesizerV Studio Basic R2. Popular SynthV voicebanks include Eleanor Forte, Kaorou Rikka, GENBU, Tsurumaki Maki, SAKI, SOLARIA, KEVIN (fan design by ivylare shown below), Stardust, ROSE, POPPY, and Kasane Teto Ai!

CeVIO Project: A collection of voice synthesizers created in collaboration with five different companies including Techno Speech and Frontier Works. Not only do they make vocal synthesizers, but their softwares have speech interfaces as well. As of now, their most popular program is CeVIO AI, a next-generation vocal synthesizer that uses AI technology to create powerful vocals as seen in SynthesizerV. Popular voicebanks include Chis-A (shown below), KAFU, Sato Sasara, IA AI, ONE, Yuzuki Yukari Rei, CiFlower, POPPY, ROSE, and many others.

Tuning: Essentially how you want a song or cover to sound. By editing the parameters of the individual notes and that of the voicebank itself (including the pitch, volume, strength, sharpness, and breaths), you can obtain an entirely different result of how the singer sings the encoded notes through different methods. This blog is dedicated to teaching people how to tune, so I’ll show a variety of tuning styles in the software.

V_: The VOCALOID software edition. As of now, there are six editions of the software, which are VOCALOID, VOCALOID2, VOCALOID3, VOCALOID4, VOCALOID5, and VOCALOID6. A lot of VOCALOID voicebanks would be named after the edition they were designed for, such as Gackpoid V4.

VSQ/VSQx/VPR/UST/SVP: The different vocal file formats through which the note, lyric, and tuning data are saved in different vocal synthesizers. These files are not exactly specific to a single editor as they can be converted to the appropriate formats:

VSQ: VOCALOID2 and VOCALOID3

VSQx: VOCALOID4

VPR: VOCALOID5 and VOCALOID6

UST: UTAU and OPENUTAU

SVP: SynthesizerV Studio

Phonemes: In linguistics and developmental psychology, phenomes are the smallest sounds of speech that distinguish one word from another. Similarly, in vocal synths, these are the building blocks of the individual lyrics that are read by the voicebank. Phonemes differ from the lyrics in a vocal synth file as the lyrics are the actual syllables in language while the phonemes are based on the X-SAMPA system. For instance, let’s examine and compare lyrics from “The Lost One’s Weeping” by neru to the phonemes that would be written in a vocal synth. Romaji lyrics (Source - Vocaloid Lyric Wiki): kokuban no kono kanji ga yomemasu ka? Romanji lyrics in VOCALOID4: [ko] [ku] [ba] [n] [no] [ko] [no] [ka] [n] [ji] [ga] [yo] [me] [ma] [su [ka] Phonemes in a vocal synthesizer VOCALOID4: [k o] [k M] [b a] [n] [n o] [k o] [k a] [n] [dZ i] [g a] [j o] [me] [m a] [s M] [k a] As we can see here, the phonemes of a song can differ significantly from the lyrics that are entered into a program. You can also edit the phonemes of a lyric for better pronunciation (for instance, for the word “you’d”, you can try [y M d]), or split them up into vowels and constants in notebending. In addition, there are entirely different phonemes for voicebanks designed for different languages; for instance, VOCALOID has Japanese, English, Chinese, Korean, and Spanish voicebanks. However, it is possible to make voicebanks sing in different languages, like how Utsu-P makes Miku V4 English sing in fluent Japanese. There are also phonemes for breaths, and glottal stops, as well as pronunciation effects that are exclusive to some voicebanks, like Enhanced Voice Expression Control (E.V.E.C.) in the V4x Cryptonloids. I will go into greater depth on phonemes in a future post.

Pitch bending: The effect where one note slides to another in a clean fashion without sounding flat. When people usually mention pitch bending in a vocal synth, they are referring to the tuning style where you alter the pitch using the “pitch bend” and “pitch bend sensitivity” parameters. If you have seen tuning streams or covers where people show their editors, you may have noticed dynamic and sometimes dramatic lines either on top of the notes or in a box beneath the piano roll. These are pitch bends! By drawing pitch curves in different ways, you can acquire different ways the notes are sung. You can then increase or decrease the pitch bend sensitivity of certain notes to change the factor of how many semitones the pitch curves will jump or fall by when the pitch bend parameter is brought to the maximum or minimum values. To paint a better picture of this concept, I made a quick VSQx of the "watashi" ([w a] [t a] [S i]). The curves on cutting through the green box are my pitch bends, and the thin red line running through the notes is the result. The transparent box behind it is my pitch bend sensitivity, which I increased for more sensitive in the [w a] and [t a] notes, and decreased for less for the [S i] phoneme.

Note bending: A tuning style where you manipulate the pitch by splitting notes into smaller notes. You can move the notes up and down or edit the phonemes to obtain different effects in notes. If you would like to breakdown the phrase [w a] [t a] [S i], you can write the notes out as [w a] [a] [a] [a] [a] [t a] [a] [S i [i] [i]! This is my preferred method of tuning as I do not enjoy drawing lines and like the nostalgic effect of the clean, slightly robotic sounds.

Portamento Timing: This term can have multiple definitions, but the general meaning is a slide from one note to the next. Do not confuse this for pitch bending as the way that notes transition in portamento is different from the former. In Vocaloid, portamento is a parameter that allows you to alter the timing of the pitch. Increasing the value would result in the pitch being more delayed, and decreasing it will cause the pitch to be sung earlier. In UTAU and SynthesizerV, portamento refers to the editable points in a pitch curve. Adding more points allows you to have more freedom in creating pitch bends.

Pitchsnap Mode: A setting in vocal synthesizers that causes the pitch curves to “snap” from one note to another. This setting yields a more autotuney and robotic tone in tuning. While I prefer to tune with this feature shut off, I have heard that the pitchsnap function makes pitch-bending much easier. Remember our "The Lost One's Weeping" example? Here is an amazing cover of it by our lord and saviour Jade S. with Fukase and Miku V3 Solid that showcases how beautiful the pitchsnap function can make the vocals sound when used correctly!

youtube

Mixing: A process of blending vocals with an off-vocal or instrumental so the singing fits in the environment of the vocal's music. It's more than just plugging in an audio track, you need to ensure that the vocals are cleaned up, are at an appropriate volume, and do not sound out of place. People can get super creative with mixing by adding reverb, radio-like effects, growls, and “adlibs” during instrumental breaks! All in all, the mixing of vocals is just as important as the tuning.

Producer: Anyone who makes music using vocal synths. This title was initially reserved for people who make original songs but can be used to describe cover artists like myself as well. Popular producers include ryo(supercell), kzlivetune, wowaka(shown below; Rest in Peace), neru, Deco* 27, and many others!

“-P”: Standing for “producer title”, this suffix originated from the IDOLM@STER fandom and refers to anyone who makes music with vocal synths, or in other words, vocal synth producers! For instance, why do we call Circus-P by his name with the "-P" suffix? Because that is what he is, a producer! You can also use the title “vocalo-p” to address synth users.

#vocaloid4 #vocal synth #vocaloid #vocaloid tutorial #vocabulary #vocaloid jargon #long post #resource #dictionary #utau #utauloid #synthesizer v #synthv #cevio

22 notes · View notes

shimmerloid-ai · 2 months

Text

rin and len

40 notes · View notes

shimmerloid-ai · 2 months

Text

Defoko! 🍚💜

2K notes · View notes

shimmerloid-ai · 2 months

Text

ㅤㅤㅤㅤㅤㅤElectric angel

ㅤㅤㅤㅤㅤㅤㅤㅤㅤ.

Привет, я Old37, эта соцсеть - новая для меня. Я буду выставлять здесь свои работы. Выше - одна из моих работ. Мне нравится этот персонаж)

ㅤㅤㅤНадеюсь, мы подружимся)

4 notes · View notes

shimmerloid-ai · 2 months

Text

More sekai fukase shenanigans

144 notes · View notes

shimmerloid-ai · 2 months

Photo

HATSUNE MIKU IS THAT YOU???

29K notes · View notes

shimmerloid-ai · 2 months

Text

Introduction - What is VOCALOID?

Hello everyone, Shimmer here! This is my first post in this guide blog thingy. I thought it would be a good idea to explain what VOCALOID actually is before I jump into how to use the software. Otherwise, it would be like baking a cake without knowing what cakes are.

So, let’s start by addressing what VOCALOID is not.

VOCALOID is NOT an anime series. Although Hatsune Miku made cameos in "Dropkick on My Devil!", she never originated from an anime series because she is NOT an anime character.

Second, VOCALOIDs are not those crappy AI voice models. You know, those weird “voicebanks” where you can make Spongebob Squarepants sing "7 Rings" or have Cartman from South Park rap "INDUSTRY BABY"? Yeah, those are actually illegal renditions of celebrity voices without the knowledge of the voice actors/influencers/singers whose voices were used to make the models. You just put the models over an audio track, and boom. Lazy, illegal shit.

youtube

Finally, this is just common sense, but VOCALOID did not originate from Project Sekai! Colorful Stage! The Cryptonloids (Miku, Rin, Len, Luka, Kaito, and Meiko) have existed long before the game was released; VOCALOID 1 was released in 2004, while the money making machine was launched in Japan in 2020. That is a gap of sixteen years, and if you compare the time between Hatsune Miku V2's release and Project Sekai, we have another thirteen year difference there.

With that being said, what *is* VOCALOID?

The best definition I can give you is that it is a digital singing synthesizer. Basically, it is an instrument, but instead of piano notes, you get vocals.

youtube

And no, *this* AKITO is not associated with the Akito Shinonome from Project Sekai.

To advertise this voicebanks and increase their appeal, Crypton, VSINGER, AH-Software Co., Internet Co. Ltd, and many other companies that make voicebanks for this software have cute or hot anime-style avatars designed for their box art. This was a great marketing scheme in my opinion, because wouldn't you be more inclined to purchasing something if it looks aesthetic, kawaii, or epic? Just look at GUMI's design!

Alright, I have a feeling I may have bored most users who are reading this weird info-dump, so I am going to add one final, important point. Remember our wood analogy? Well, we have the workbench (VOCALOID), and the wood (the voicebank(s) of your choice). Making a desk for instance would be like making a cover of a song. But people can make the same kind of desk with an entirely different appearance or texture. Similarly, a lot of producers can make covers of the same song, but they can sound entirely different in regards to their pitch, tone, or melody. This aspect is known as "tuning".

Tuning is basically the process of editing the properties of a voicebank and the notes/lyrics they are singing to create a specific sound. People can tune the same song in different ways. For instance, listen to the original "Rolling Girl" by wowaka, and then these covers. They are all the same song, but tuned in entirely different ways.

Below is the original song:

youtube

And these are all covers:

youtube

Also yeah, that last cover is mine, it's my blog, I can promote my content if I want to)

I hope that just by listening to these you can see how tuning can vary from individual to individual. Its all a matter of how you control the parameters of the singer.

So yeah, I yapped enough so I'm gonna end this infodump right here. I'm not surprised if you guys are still confused, so I'm going to leave some helpful resources down below as these people are better at explaining shit than I am.

youtube

My next post will involve some common terminology used in the VOCALOID community, such as “VSQx”, or “pitchbending”.

Goodbye for now!

#vocaloidproducer #vocaloid #vocal synth #vocaloid4 #Vocaloid tutorial #Youtube

22 notes · View notes