{"id":35321,"date":"2026-03-25T09:58:34","date_gmt":"2026-03-25T16:58:34","guid":{"rendered":"https:\/\/www.podfeet.com\/blog\/?p=35321"},"modified":"2026-03-25T17:06:21","modified_gmt":"2026-03-26T00:06:21","slug":"audiobook-audio-v3-part-2-the-v3-chain-by-eddie-tonkoi","status":"publish","type":"post","link":"https:\/\/www.podfeet.com\/blog\/2026\/03\/audiobook-audio-v3-part-2-the-v3-chain-by-eddie-tonkoi\/","title":{"rendered":"Audiobook Audio (V3) Part 2: The V3 Chain \u2014 by Eddie Tonkoi"},"content":{"rendered":"<h3>The V3 Chain and The Few Decisions I Refuse to Wing<\/h3>\n<p>It\u2019s not long ago I wrote a piece describing my older process\u2014one that was ingenious, thought-through, bullet-proof, and, it turns out, flawed. It\u2019s not that the old process was bad. It\u2019s just that it turned out to be less efficient than I\u2019d thought, and it also turned out I was looking at things backwards. In the past, I recorded my audio, did some processing, edited it, then continued the processing. I had bought iZotope RX11 and it performs near-miracles on poor audio to make it useful.<\/p>\n<p>And the problem with that? It meant I\u2019d accept poor audio and then try to fix it.<\/p>\n<p>But having lived through and discarded a lengthy editing process, I now realise that repairing poor audio doesn\u2019t give as good a result as polishing good audio. Who\u2019d have thought?<\/p>\n<h3>How did I get here?<\/h3>\n<p>I was tidying up my audio, removing mouth clicks, those smacking sounds your mouth can make when it opens and closes, and after a while noticed something odd. Some of my consonants were disappearing as well. I dialled back the strength of the tool, and managed to get to a position that removed some clicks and no consonants. Happy, I moved on.<\/p>\n<p>Then I noticed something else odd. I was removing breath noise, and some of my consonants were disappearing. Again. I dialled back the settings, tweaking them this way and that, but no matter what I did, I couldn\u2019t save the consonants whilst removing the breath noise. In fact, the tool was identifying and cleaning my consonants before even touching the breath.<\/p>\n<p>That\u2019s what made me stop, pause, and reflect.<br \/>\nThat\u2019s when I realised I had this all wrong.<br \/>\nThat\u2019s when I decided to make this series\u2014to push myself to understand properly what I\u2019m doing.<\/p>\n<h3>Where am I?<\/h3>\n<p>I\u2019ve spent several hours learning, testing, and learning some more. I like learning. After that time, I\u2019m now in a place where I believe, from a position of understanding, these three things:<\/p>\n<ol>\n<li>Audio should be recorded as close to the desired output as possible.<\/li>\n<li>Editing should be done on that high-quality audio.<\/li>\n<li>Processing should be kept to a minimum\u2014ideally, only loudness being adjusted to meet specifications.<\/li>\n<\/ol>\n<p>That means, ideally, no RX11 De-click, Voice De-noise, Mouth De-click, De-ess, or even EQ (though that\u2019s probably going a bit too far). In practice, I\u2019m not quite there, but I\u2019m surprisingly close: only a gentle bit of denoising and Mouth De-click early on and a subtle EQ adjustment before Loudness Control.<\/p>\n<h3>More detail on the chain<\/h3>\n<p>It all now starts with me trying to get the best audio I can out of the microphone\u2014which really means getting as close to the final product as I can. I\u2019ll go into that properly in the next couple of segments, because this turns out to be the real lever. For now, all that\u2019s relevant is that I record using Audio Hijack, which is simply a rock-solid app that saves an uncompressed 32-bit, 44.1 kHz audio file.<br \/>\nOh, and I do have one confession.<\/p>\n<p>Going against best practice, as if that\u2019s a solid thing, I put trust in my microphone\u2014the Shure MV7+\u2014and I enable its internal De-noise functionality. Nothing else, but it does mean the audio I record is technically called <strong>dry<\/strong> not <strong>raw<\/strong>, because it has been processed. However, I have really tested this, and I cannot find any problems with its de-noising: no artefacts in the audio\u2014and it reduces background noise. Purists would say I should turn it off and use RX11 instead, so that I hold onto a true raw file, but in blind tests, the MV7+ has seemed to do a better job than RX11. I\u2019m keeping it on.<\/p>\n<p>At this point, I run a gentle Mouth De-click using RX11, though I keep the original file as well in case I notice something I want to recover. It hasn\u2019t happened yet. I actually do this as a plug-in to Audio Hijack as it records, so it doesn\u2019t cost me any time.<\/p>\n<p>I then import the file into Logic Pro, with each chapter being a separate track. Logic has a great function that removes silences, so I choose something quite conservative to get rid of the long pauses I sometimes have when a train goes past, or when I stop to read ahead. The settings I use look for places where the volume stays below -40 dB for at least 1.8 seconds, and then it trims that near-silence to just 1.8 seconds long. It doesn\u2019t do much, but it saves me some time.<\/p>\n<p>Now we\u2019re onto the editing stage, which is lengthy. I listen through on the computer with my good headphones, and do just a few things, ideally:<\/p>\n<ol>\n<li>Remove fluffs\u2014where I said a line incorrectly, paused, and repeated it.<\/li>\n<li>Shorten gaps between sentences or paragraphs, which could be due to me pausing to breathe or read ahead. Many of these will be that 1.8 seconds long now, which is always too long.<\/li>\n<li>Adjust gaps between words and sentences. Especially as it is me speaking, I know the cadence of the text, and so I know when the next word is meant to land. If it noticeably misfires, I go in and make tiny edits to move the word forwards or backwards. It\u2019s almost always forwards.<\/li>\n<li>Reduce the volume of objectionable breathing or knocks. I do this in Logic by reducing the gain around the noise, aiming to make it less objectionable\u2014but not aiming to remove it. Humans do breathe, after all.<\/li>\n<li>Bounce the audio, which means I export it, giving me a rendered version of the edited track.<\/li>\n<li>Adjust the loudness using RX11 to bring it to audiobook specifications.<\/li>\n<li>Listen through on my iPhone, making note of anything that needs adjusting so I can go back into Logic to fix it.<\/li>\n<\/ol>\n<p>Once all that is done, it\u2019s just a question of deciding if the EQ is correct \u2014 or if I want to reduce the bass a little, or suppress a particular frequency (500 Hz and 5000 Hz can be a bit boomy for my voice).<\/p>\n<h3>Where this leaves me<\/h3>\n<p>This audio chain leaves me with a minimal number of tools. I know, it sounded quite long, but I have tried to make it minimal. I\u2019ve stripped out de-essing and breath control, and switched to a light touch with other passes so that I can just run them and not worry about losing things. They&#8217;re simply not aggressive enough to cause problems, which also means they\u2019re not aggressive enough to do much fixing.<br \/>\nBut, here\u2019s the point, what that chain really does is remove excuses\u2014it shifts my time from repairing damage to enhancing something that\u2019s already good.<\/p>\n<p>The goal is simple: <strong>capture close to the target<\/strong>, and do as little as possible afterwards. And what that means is that if I\u2019m not going to repair tone later, then tone has to be right <strong>at capture<\/strong>. And if tone has to be right at capture, then the biggest part of the \u201cprocessing\u201d isn\u2019t a plugin at all\u2014it\u2019s microphone technique.<\/p>\n<p>And <em>that<\/em>, is the subject for the next couple of segments: how I made mic position boring, repeatable, and reliable enough that I could stop guessing.<\/p>\n<p>If you want to know more, come and ask me over in the Slack community at <a href=\"https:\/\/www.podfeet.com\/slack\">podfeet.com\/slack<\/a>, where I and all the other lovely NosillaCastaways enjoy friendly, positive online conversations. Feel free to message me, Eddie Tonkoi, if you have any thoughts, questions, or techniques you&#8217;re using. It would be nice to share ideas.<\/p>\n<p>You can also find our work at <a href=\"https:\/\/jerntonkoi.com\/\">jerntonkoi.com<\/a>, where you\u2019ll find Jern\u2019s character-driven queer love stories, the audiobooks I produce for them, and bonus material for our subscribers.<\/p>\n<p>I\u2019ll be back soon to talk through some more of my workflow, but for now, happy recording, and happy reading.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The V3 Chain and The Few Decisions I Refuse to Wing It\u2019s not long ago I wrote a piece describing my older process\u2014one that was ingenious, thought-through, bullet-proof, and, it turns out, flawed. It\u2019s not that the old process was bad. It\u2019s just that it turned out to be less efficient than I\u2019d thought, and [&hellip;]<\/p>\n","protected":false},"author":34,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_feature_clip_id":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_post_was_ever_published":false},"categories":[147],"tags":[1604,7881,7887,7864,869,7886,3511,7863,7885],"class_list":["post-35321","post","type-post","status-publish","format-standard","hentry","category-blog-posts","tag-audiobook","tag-audiobook-audio-v3","tag-capture-first","tag-de-essing","tag-microphone","tag-minimal-processing","tag-noise","tag-recording-audiobook","tag-recording-chain"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.podfeet.com\/blog\/wp-json\/wp\/v2\/posts\/35321","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.podfeet.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.podfeet.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.podfeet.com\/blog\/wp-json\/wp\/v2\/users\/34"}],"replies":[{"embeddable":true,"href":"https:\/\/www.podfeet.com\/blog\/wp-json\/wp\/v2\/comments?post=35321"}],"version-history":[{"count":10,"href":"https:\/\/www.podfeet.com\/blog\/wp-json\/wp\/v2\/posts\/35321\/revisions"}],"predecessor-version":[{"id":35649,"href":"https:\/\/www.podfeet.com\/blog\/wp-json\/wp\/v2\/posts\/35321\/revisions\/35649"}],"wp:attachment":[{"href":"https:\/\/www.podfeet.com\/blog\/wp-json\/wp\/v2\/media?parent=35321"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.podfeet.com\/blog\/wp-json\/wp\/v2\/categories?post=35321"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.podfeet.com\/blog\/wp-json\/wp\/v2\/tags?post=35321"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}