A blog about automation and technologies in the cloud

2023-12-13T21:33:29+00:00

Hi,

This script works perfectly. I have added some of my additional small improvements, but I am struggling with one issue. Namely, I can’t get Croatian diacritical marks such as “Č” or “Š” in the output audio file, as the Speech API reads them as simple “C” or “S”, which is not correct. How to solve it?

LikeLike

Reply

2023-12-14T07:36:30+00:00

I had the same issue with norwegian characters like æøå. Havent look any more into it. Didnt there come new text to speech AI models from Open AI a few weeks ago? Havent looked if they are in Azure OpenAI now.

LikeLike

Reply

	$AzureSpeechSubscriptionKey = 'enter your key here'
	$AzureSpeechRegion = 'norwayeast'
	$Language = 'en-us'
	$VoiceName = 'en-US-JennyNeural'
	$Style = 'whispering'

	$FetchTokenHeader = @{
	'Content-type'='application/x-www-form-urlencoded';
	'Content-Length'= '0';
	'Ocp-Apim-Subscription-Key' = $AzureSpeechSubscriptionKey
	}

	$OAuthToken = Invoke-RestMethod -Method POST -Uri https://$AzureSpeechRegion.api.cognitive.microsoft.com/sts/v1.0/issueToken -Headers $FetchTokenHeader

	# show the token received
	$OAuthToken


	$MyHeader = @{"Authorization" = "Bearer $OAuthToken";
	"X-Microsoft-OutputFormat" = "audio-16khz-128kbitrate-mono-mp3" }
	$uri = "https://$AzureSpeechRegion.tts.speech.microsoft.com/cognitiveservices/v1"





	$Body = @"
	<speak version='1.0' xml:lang='$Language'>
	<voice name="$VoiceName" style="$Style" styledegree="2">
	Hi my name is Jenny. I am a neural voice. This is what I sound like when I have a American voice and im whispering.

	</voice>
	</speak>


	"@

	Invoke-RestMethod -Method Post -ContentType "application/ssml+xml" -Headers $MyHeader -Body $Body -Uri $uri -OutFile "audio1.wav"

	$AzureSpeechSubscriptionKey = 'enter your key here'
	$AzureSpeechRegion = 'norwayeast'
	$Language = 'en-IE'
	$VoiceName = 'en-IE-EmilyNeural'

	$FetchTokenHeader = @{
	'Content-type'='application/x-www-form-urlencoded';
	'Content-Length'= '0';
	'Ocp-Apim-Subscription-Key' = $AzureSpeechSubscriptionKey
	}

	$OAuthToken = Invoke-RestMethod -Method POST -Uri https://$AzureSpeechRegion.api.cognitive.microsoft.com/sts/v1.0/issueToken -Headers $FetchTokenHeader

	# show the token received
	$OAuthToken


	$MyHeader = @{"Authorization" = "Bearer $OAuthToken";
	"X-Microsoft-OutputFormat" = "audio-16khz-128kbitrate-mono-mp3" }
	$uri = "https://$AzureSpeechRegion.tts.speech.microsoft.com/cognitiveservices/v1"




	$Body = @"
	<speak version='1.0' xml:lang='$Language'>
	<voice name="$VoiceName">
	Hi my name is Emily. I am a neural voice. This is what I sound like when im using an Irish voice with no voice style.

	</voice>
	</speak>


	"@

	Invoke-RestMethod -Method Post -ContentType "application/ssml+xml" -Headers $MyHeader -Body $Body -Uri $uri -OutFile "audio1.wav"

A blog about automation and technologies in the cloud

Azure AI Speech Service and PowerShell

2 thoughts on “Azure AI Speech Service and PowerShell”

Leave a comment Cancel reply

Share this:

Related

2 thoughts on “Azure AI Speech Service and PowerShell”

Leave a comment Cancel reply