Talk:Text to Speech: Difference between revisions

From Valve Developer Community
Jump to navigation Jump to search
No edit summary
(Help! Errors!)
Line 1: Line 1:
Interesting. Do you have any sound-samples? --[[User:Dutchmega|dutchmega]] 10:34, 14 Aug 2006 (PDT)
Interesting. Do you have any sound-samples? --[[User:Dutchmega|dutchmega]] 10:34, 14 Aug 2006 (PDT)
:Dutchmega, in my user page is a link to a [http://infolab.northwestern.edu/#projects/40---people/116 News at Seven] page that has a video of our current system; it uses the same TTS stuff as the code in this page.  You can watch the video (it's less than 20 megs, I believe) to get a good idea of how good the voices can sound.  The only modification we're doing right now to the stock SAPI speech is a small, hardcoded dictionary of transformations for hard-to-pronounce phrases (ie, 'al Qaeda' becomes 'all chi de', etc.)  In the future, we have plans to vary the speech produced based on the content/affectation of the text, but right now it's just basic TTS.  If I get a chance, I'll post some actual sound samples later.  Thanks.--[[User:Ndnichols|Ndnichols]] 07:05, 21 Aug 2006 (PDT)
:Dutchmega, in my user page is a link to a [http://infolab.northwestern.edu/#projects/40---people/116 News at Seven] page that has a video of our current system; it uses the same TTS stuff as the code in this page.  You can watch the video (it's less than 20 megs, I believe) to get a good idea of how good the voices can sound.  The only modification we're doing right now to the stock SAPI speech is a small, hardcoded dictionary of transformations for hard-to-pronounce phrases (ie, 'al Qaeda' becomes 'all chi de', etc.)  In the future, we have plans to vary the speech produced based on the content/affectation of the text, but right now it's just basic TTS.  If I get a chance, I'll post some actual sound samples later.  Thanks.--[[User:Ndnichols|Ndnichols]] 07:05, 21 Aug 2006 (PDT)
== Help! Errors! ==
I followed the instuctions on the page (I installed all of the required stuff).
I imported the hl2TTS module into python, but when i went to do the creation of the .wav (hl2TTS.StraightForwardRecord("The quick brown fox jumped over the lazy dog.", "c:\\test")) i get the error
"Traceback (most recent call last):
  File "<interactive input>", line 1, in ?
  File "hl2TTS.py", line 331, in StraightForwardRecord
    RecordWAV(safeFilename, text)
  File "hl2TTS.py", line 79, in RecordWAV
    wordTimings = GetTimings(text)
  File "hl2TTS.py", line 98, in GetTimings
    wordTimings[-1].append([event.CurrentID, event.NextID, event.Duration])
IndexError: list index out of range"
The .wav is created, but it is not linked with any phenomes when I open it with FacePoser.

Revision as of 20:37, 22 September 2006

Interesting. Do you have any sound-samples? --dutchmega 10:34, 14 Aug 2006 (PDT)

Dutchmega, in my user page is a link to a News at Seven page that has a video of our current system; it uses the same TTS stuff as the code in this page. You can watch the video (it's less than 20 megs, I believe) to get a good idea of how good the voices can sound. The only modification we're doing right now to the stock SAPI speech is a small, hardcoded dictionary of transformations for hard-to-pronounce phrases (ie, 'al Qaeda' becomes 'all chi de', etc.) In the future, we have plans to vary the speech produced based on the content/affectation of the text, but right now it's just basic TTS. If I get a chance, I'll post some actual sound samples later. Thanks.--Ndnichols 07:05, 21 Aug 2006 (PDT)

Help! Errors!

I followed the instuctions on the page (I installed all of the required stuff).

I imported the hl2TTS module into python, but when i went to do the creation of the .wav (hl2TTS.StraightForwardRecord("The quick brown fox jumped over the lazy dog.", "c:\\test")) i get the error

"Traceback (most recent call last):

 File "<interactive input>", line 1, in ?
 File "hl2TTS.py", line 331, in StraightForwardRecord
   RecordWAV(safeFilename, text)
 File "hl2TTS.py", line 79, in RecordWAV
   wordTimings = GetTimings(text)
 File "hl2TTS.py", line 98, in GetTimings
   wordTimings[-1].append([event.CurrentID, event.NextID, event.Duration])

IndexError: list index out of range"

The .wav is created, but it is not linked with any phenomes when I open it with FacePoser.