Show HN: I made a tool using AI to convert articles to audio

40 points by Paul_Grsl 3 years ago

nceasy 3 years ago

Hey nice one! I'm trying to innovate in the STT and TTS space as well (completely different field), feel free to contact me in my profile email if you want to exchange some knowledge :). I hear some bugs in portuguese, but I'm guessing the trained model has some issues. Congrats and Good luck with your tool! Ps: give users some preview of the speakers voice so we can test it before convert audio, it can save you some resources.

Paul_Grsl 3 years ago

Hello, we can meet on twitter: grsl_en :) thanks for your comment, it's still a v1 in beta but I will continue to improve it. And yes as you can see in the ui there are buttons for preview but not yet functional :)

trabant00 3 years ago

It does not handle apostrophes good. It says "we L L" instead of we'll or "Andrea S" instead of Andrea's. It also has some problem with pacing around dashes, three points and quotes. It speeds up a lot and connects the words before and after those. Overall would not use it in this state to turn articles into podcasts or something like that.

capableweb 3 years ago

What about something that can do the opposite? Like converting video and/or audio to articles?

Most of the content I consume fits best (for me) in article format, so I can read it at my own speed, but some really good information can only (annoyingly) be found in videos or podcasts.

roryisok 3 years ago

I've been using whisper from openAI to transcribe stuff this month, and its incredibly accurate. would be a good base for something like this
Paul_Grsl 3 years ago

This is a good idea, I just started the project :) So this is a feature that can be added in the future.
Here is the roadmap for the future: - Audio sharing - Convert Text To Audio - Convert PDF To Audio - Convert Photo To Audio - Chrome extension - to convert while browsing - Mobile App - to manage audios everywhere, simply
and adding the possibility to do the opposite is also a great idea!
cblavier 3 years ago

Working on this very topic right now! But specific to Podcast audio content
https://readable.fm/
veb 3 years ago

oh mate I so agree and as a deaf person this would be a godsend. way too much shit is in videos or podcasts. please just let me read...

kretaceous 3 years ago

I got super psyched to try this. Always wanted a good TTS extension or app.

However I cannot get it to work. I've logged in, input an article but nothing happens after I click "Convert article to audio" or preview.

Linux Mint/Firefox 107/Chrome

Edit:

I checked devtools and it shows a 500 error with the message "Something is broken. Please let us know what you did"

The link I was trying to convert was:

https://www.daemonology.net/blog/2020-09-20-On-the-use-of-a-...

westcort 3 years ago

Here is a bookmarklet that does TTS: https://locserendipity.com/Speaker.html
Also: https://www.locserendipity.com/TTS.html
- Paul_Grsl 3 years ago
  
  Except that the quality of the audio rendering is... not crazy.
  
  westcort 3 years ago
  
  It defaults to whatever your browser defaults to. It is easy to change that setting in Chrome to a more natural voice: https://support.google.com/chromebook/answer/11221616?hl=en
Paul_Grsl 3 years ago

I'm looking into how this can happen. Feel free to sign up to come back later and convert your article! I can't wait for you to use Article.Audio

akuji1993 3 years ago

Hey, your tool is not working great with german articles. It can't pronounce Umlauts and also has trouble with some pretty standard simple words.

I used this article as a test: https://www.saarbruecker-zeitung.de/saarland/landespolitik/s...

Paul_Grsl 3 years ago

I will look at why the charactere that Umlauts are not well pronounced. Do you have examples of other words? I will investigate :)
Danke für dein Feedback, das hilft mir wirklich weiter .
- cauners 3 years ago
  
  On the same topic, in Latvian all the characters with diacritics are stripped out completely. For example, "iedzīšana" is pronounced as "iedzana", making the audio pretty funny, though hard to understand.
  
  Paul_Grsl 3 years ago
  
  I'm working on solving these pronunciation problems for the special characters if I unlock one everything else will follow. Don't hesitate to sign up to be kept informed!

nkoren 3 years ago

Huh, wow, really cool! For the most part it's excellent (in English), however I notice that quotation marks (both single and double) are handled strangely. The leading quotation mark is pronounced as a short A, and the trailing as a long A. This can be rather confusing! But otherwise, I'm incredibly impressed by the results.

Paul_Grsl 3 years ago

Thank you very much for your comment!! It's still a beta version and I have to improve some pronunciations! Especially for the special characters... don't hesitate to signup to be kept informed :)
- wazoox 3 years ago
  
  It's certainly that all UTF-8 characters (like UTF-8 fancy quotes and double-quotes) aren't properly interpreted, everything seems to go through as ISO8859-15.
malshe 3 years ago

I also noticed the same issue with quotation marks. But other than that this is a really nice application

schreon 3 years ago

Nice! The german version breaks Umlauts though. Apparently the preprocessing converts e.g. "ä" to "ae", "ö" to "oe" and so on and the text2voice model subsequently pronounces them as e.g. "a-e", instead of what "ä" would actually sound.

Paul_Grsl 3 years ago

Indeed we have a problem with special characters in German (and also Polish… :( ) I am investigating why they are not pronounced correctly.
I studied German at school but not enough.
Vielen Dank für Ihren Kommentar, er hilft mir, das Tool zu verbessern :)

hutrdvnj 3 years ago

Could you make a paid TTS engine App on Android and iOS?

Paul_Grsl 3 years ago

Yes it's in the roadmap. and also a chrome extension :) would you be interested ?
- hutrdvnj 3 years ago
  
  Yes, I currently use the Read Aloud Android App and it allows me to use any installed TTS. The Google free network TTS voices are quite okay, but I know that there are better premium voices unfortunately I didn't found any high quality human like TTS in the play store as of yet.

tiffanyh 3 years ago

Dumb question: how is “AI” used for text-to-speech?

Paul_Grsl 3 years ago

Text-to-Speech (TTS) technology uses artificial intelligence (AI) translate written information in a given language into a sound, voice or speech with a human accent.to learn the AI had to learn with many parameters, so that the pronunciation improves from version to version :)

aronatom 3 years ago

Tested icelandic! sound really good except for ignoring all special icelandic character such as ð, ó, á ,ö and so on

Paul_Grsl 3 years ago

Thanks for your comment! I'm actually working on this point to fix it as FAST as possible! sorry for that...
roland_szabo 3 years ago

I had the same issue with hungarian language.
- Paul_Grsl 3 years ago
  
  I'm working on solving these pronunciation problems for the special characters if I unlock one everything else will follow. Don't hesitate to sign up to be kept informed!
  
  aronatom 3 years ago
  
  I will! Great stuff

judex 3 years ago

Great work! I'm testing if I could use it in my project. It would be good to be able to just paste some text.

Paul_Grsl 3 years ago

Thank you so much! What's your project?
Here is the roadmap for the future: · Audio sharing ----> FOR YOU · Convert Text To Audio · Convert PDF To Audio · Convert Photo To Audio · Chrome extension - to convert while browsing · Mobile App - to manage audios everywhere, simply

jerpint 3 years ago

I had this idea a few months ago, obviously I never got around to executing it. I’m glad someone else did

Paul_Grsl 3 years ago

That famous moment when you have the idea of a new side project, you buy the domain name and then... you have a new idea (loop).
This time I developed it, I'm happy with this 1st version (which must be improved).
Anyway, thanks for your comment
And you, why didn't you develop it in the end?
- jerpint 3 years ago
  
  I can’t remember the list of thousands of excuses I came up with :)
  
  Paul_Grsl 3 years ago
  
  Haha. destroy this list and go for it :D

5amdotis 3 years ago

I would like to have a tool that does it the other way around. Audio to a somewhat cohesive article.

wazoox 3 years ago

I can only get "200 internal server error" entries in the dev console :)

Paul_Grsl 3 years ago

Wow... :( Still now ?
- wazoox 3 years ago
  
  It works on Chromium though. However it has trouble with UTF-8 obviously: it interprets "é" as "Ã©" i.e. ISO8859-15.
  I see that for "French" it selects "Alain" as a voice in Chromium, but "Joe" in Firefox. However in Firefox I must first change the language, it then select another voice, and switching back to French, it switches properly to Alain then it works. It probably doesn't initialise properly the voice selector when loading the page in Firefox (if I don't change the language first, I can't select any voice, the selector isn't working).
  Same problem with accents as Chromium though :)

herinskd 3 years ago

It works amazingly fast, are you using GPUs and QNNX to reach such performance?

Paul_Grsl 3 years ago

Thanks for the great feedback. I think we can still improve the result, it's only a v1 in beta. I use nothing very advanced for rendering, a stack and tools rather simple. :)
zachthewf 3 years ago

I believe it's using Microsoft TTS voices (at least for some of them)
- Paul_Grsl 3 years ago
  
  That's right we use Azure TTS! :)

phas0ruk 3 years ago

Nice, what’s the tech stack?

Paul_Grsl 3 years ago

Simple: PHP (Symfony), JS (Vanilla), HTML, TailwindCSS :)

loriverkutya 3 years ago

Not usable with Hungarian.

Paul_Grsl 3 years ago

I'm looking into it, a Polish user is having the same problems. Don't hesitate to sign up so I can let you know when it's fixed :)
Köszönöm a hozzászólásodat, ez segít nekem! :)

fieryskiff11 3 years ago