This release fixes two crashing bugs that do not happen all the time, though chances of occurring still exist and occurrences have been reported by only two users so far. There's no immediate need to upgrade to 0.8.3 if the previous version works fine under your setup.
I got quite intrigued by the design of audio devices ever since getting my ER4 earphone, which claims to reproduce the pressure field in ear canal as what it would be in reality.
The only thing sounding fishy to me is how could a soundcard with an output impedance of 40 Ohm drive a 45 Ohm earphone without voltage division taking place. A search on Google told me nothing besides audiophiles selling their impedance matching snake oil. It just didn't seem like many people had done a serious measurement on this matter, so I decided to do one, and also for what will happen if an op-amp is used to boost the current delivery.
The report can be downloaded from here.
Our users in Japan had long discovered that Moresampler adds a vibe to the voice when modulation parameter goes to 100%. (FYI: what modulation does is to restore the pitch fluctuation in the recording.) While this feature technically isn't deliberately designed, it comes as a surprisingly nice byproduct of Moresampler's sophisticated algorithm and architecture. Though the mod 100% vibe isn't perfect yet - in some cases just like using mod 100% on many other engines, the pitch goes completely off making the voice sound almost "drunk". In this release I present a fix to this problem and I'd recommend everyone to give it a try.
Inspired by the modulation parameter, a new flag '
Mp' is added that randomly perturbs the pitch curve. The number after
Mp controls the degree of perturbation. While the range is from 0 to 100, a small number around 5 should be enough to notice the difference.
An Arpasing-related bug in the oto generator is also fixed. Moresampler 0.8.2 is fully compatible with the recently updated Arpasing 0.2.
This is an update to Arpasing standard (0.2.0), recording script (0.2.0) and Arpasing Assistant plugin (0.2).
Since the initial launch a few months ago I've received a lot of feedback on various aspects of the Arpasing framework. This update is a major renovation to the whole system in regard to the feedback. The major changes are,
- Triphone/quadphone units are removed.
- "ax" (schwa) is added to the phone set.
Arpasing recording script:
- The N-gram coverage script is redesigned.
- Extension script "isolated vowels" is added.
- Bug fixes.
- An option is added to force the note duration to be uniform (explained in detail in
Arpasing toolkit (version 0.2) can be downloaded from here.
I'm also going to update Moresampler very soon.
So as promised a year ago, I updated the libllsm repository to the latest version used by Moresampler.
There're quite some changes to the legacy version, in particular the two-layer representation of speech signal. The legacy libllsm is now called layer 0 LLSM, and there's a layer 1 model being added. The layer 1 model is an acoustic re-interpretation of the harmonic component in layer 0 and the conversion from layer 0 to layer 1 involves a new algorithm, which I prefer to call as Pseudo Glottal Inverse Filtering (PGIF), that approximately inverses the speech production process. All these changes are made in the direction that sticks to the principles of speech production, and enables more acoustically-meaningful modification of speech.
More details can be found in
Thanks to @OcarinaLink24 's report, I found a severe bug in the recently upgraded pitch estimator where by mistake a standard deviation value was treated as a variance. This resulted in a strong noise being added to the input before running pitch estimation, and it tremendously reduces the accuracy of pitch and voicing detection. This bug is fixed in Moresampler 0.8.1 as the only change made, but it is a very important bug fix.
0.8.1 (Mar. 29, 2017) Download
- Bug fix: a severe bug in the recently upgraded pitch estimator.
A few months ago I uploaded a document proposing a new English UTAU recording script with detailed specifications. As an attempt to basically replicate a unit-selection based speech synthesizer in UTAU, the new standard was named Arpasing. Thanks to a few users who recorded the very first Arpasing voicebanks in spite of the lack of clear instructions, we're now able to further explore the uncharted land with Moresampler 0.8.0's built-in support for Arapsing oto generation. Here I'm launching another tool, and this time for actually creating USTs with Arpasing.
Please keep in mind that Arpasing is an experiment, and we don't yet know if it's going to work well, until more efforts are made to revise the tools & voicebanks.
Here is the long-awaited Moresampler 0.8.0.
Moresampler 0.7.2 is in the transition between 0.7.x and 0.8.x - Arpasing is not supported yet but some oto-generation features such as loading files from
index.csv are implemented. There are some code refactoring & bug fixes as well.
The oto generator now is able to generate CVVC oto entries. Regardless of the format of input (hiragana/romaji), it outputs in any format you want. Once a voicebank is loaded there will be a prompt in the command line window asking for output format.
In response to feedback that Moresampler's oto generator creates a lot of redundant (in a diphone synthesis sense) unit aliases, I've added an option that only updates the existing entries in a given
oto.ini file, but not creating any new entry. Note that to enable this feature you need to have a "vanilla"
oto.ini under the voicebank directory first.
Shortly after the release of Arpasing proposal, Adlez recorded a test voicebank and sent back to me in almost no time. I'd like to thank Uchuu and BagHeadChan for their contribution as well. Those data really helped me to work out a roughly working solution to voicebank labeling so I could start actually making songs with Arpasing voicebanks.
Here are two short samples I've created with Adlez's voicebank,
|Unravel (English ver, lyrics by Lucy)||
("Oh won't you tell me, please just tell me, explain how this should work. I fear who could it be, that lives inside of me. My conscience cracking, mind reacting, surrounded by the world. But here you're smiling bright, completely blind to life.")
|Tokio Funka (English ver, lyrics by Jayn)||
("In a hazy town with deception all around, there's a dusty cloud that drifts downward to the ground.")