TTS

TTS(Text to Speech) 기술은 작성된 텍스트를 합성 음성으로 변환하여 청취할 수 있게 해주는 시스템입니다. 이 기술은 가상 어시스턴트, 오디오북, 음성 안내 시스템 등에 사용되어 사용자들이 텍스트 정보를 듣는 방식으로 접근할 수 있게 해줍니다. TTS는 정보 접근성을 향상시키고 다양한 멀티미디어 콘텐츠를 제작하는 데에도 유용하게 활용됩니다.

지원 언어 : 한국어, 영어

Available Models

Model

Description

tts-251203

완전히 새로운 목소리를 생성하면서도, 사용자가 제공한 음성의 스타일과 분위기를 자연스럽게 반영합니다. 원하는 음성만 넣으면 그 스타일 그대로 말합니다. 빠른 말투는 빠르게, 밝은 톤은 밝게, 별도의 설정 없이도 자연스럽게 따라가는 TTS 모델입니다.

모델을 버전 정보 없이 입력하는 경우 최신 모델이 호출 됩니다.

Request

Method

EndPoint

post

https://aiplatform-api.rest.univa.co.kr/rest/v1/audio/tts

Request Headers

Name

Value

Content-Type

application/json

x-api-key

UNIVA-API-KEY

curl -X POST 'https://aiplatform-api.rest.univa.co.kr/rest/v1/audio/tts' \
  -H 'Content-Type: application/json' \
  -H 'x-api-key: UNIVA-API-KEY' \ # API 키를 입력하세요.
  -d '{"data": "사용자 입력 텍스트","model": "tts","options": {"stream": false,"speaker_id": "voice1","speed": 1,"segment_gap": 0.05}}'
  #  버전 설정 방법 : tts-{version}
  # speaker_id 선택 옵션 : voice1, voice2, voice3, voice4

const axios = require('axios')

const apiKey = 'UNIVA-API-KEY' # API 키를 입력하세요.
const apiUrl = 'https://aiplatform-api.rest.univa.co.kr/rest/v1/audio/tts'
const data = {
  model: 'tts',
  //  버전 설정 방법 : tts-{version}
  data: '사용자 입력 텍스트',
  options: {
    stream: false,
    speaker_id: 'voice1',
    // speaker_id voice1, voice2, voice3, voice4
    speed: 1,
    segment_gap: 0.05,
  },
}

async function ttsPostRequest() {
  try {
    axios
      .post(apiUrl, data, {
        headers: {
          'Content-Type': 'application/json',
          'x-api-key': apiKey,
        },
      })
      .then((response) => {
        console.log('Response status:', response.status)
        console.log('Response data:', response.data)
      })
      .catch((error) => {
        console.error(
          'Error:',
          error.response ? error.response.data : error.message
        )
      })
  } catch (error) {
    console.error('에러 발생:', error)
  }
}

ttsPostRequest()

import org.apache.http.HttpEntity;
import org.apache.http.HttpResponse;
import org.apache.http.client.methods.HttpPost;
import org.apache.http.entity.ContentType;
import org.apache.http.entity.StringEntity;
import org.apache.http.impl.client.CloseableHttpClient;
import org.apache.http.impl.client.HttpClients;
import org.apache.http.util.EntityUtils;
import java.io.IOException;

public class TTSPostExample {
    public static void main(String[] args) {
        String apiKey = "UNIVA-API-KEY"; // API 키를 입력하세요.
        String url = "https://aiplatform-api.rest.univa.co.kr/rest/v1/audio/tts";

        CloseableHttpClient httpClient = HttpClients.createDefault();

        HttpPost httpPost = new HttpPost(url);
        httpPost.setHeader("Content-Type", "application/json");
        httpPost.setHeader("x-api-key", apiKey);

        // JSON 형식의 body 데이터
        String json = "{ \"model\": \"tts\",\n\"data\": \"사용자 입력 텍스트\",\n\"options\": {\n\"stream\": false, \"speaker_id\": \"voice1\", \n\"speed\": 3,\n \"segment_gap\": 0.1}}";
        //  버전 설정 방법 : tts-{version}
        // speaker_id voice1, voice2, voice3, voice4

        StringEntity entity = new StringEntity(json, ContentType.APPLICATION_JSON);

        httpPost.setEntity(entity);

        try {
            HttpResponse response = httpClient.execute(httpPost);
            HttpEntity responseEntity = response.getEntity();
            String responseString = EntityUtils.toString(responseEntity, "UTF-8");
            System.out.println(responseString);
        } catch (IOException e) {
            e.printStackTrace();
        } finally {
            try {
                httpClient.close();
            } catch (IOException e) {
                e.printStackTrace();
            }
        }
    }
}

import requests
import json

url = "https://aiplatform-api.rest.univa.co.kr/rest/v1/audio/tts"
api_key = "UNIVA-API-KEY" # API 키를 입력하세요.

headers = {
    "Content-Type": "application/json",
    "x-api-key": api_key
}

data = {
    "data": "사용자 입력 텍스트",
    "model": "tts",
    #  버전 설정 방법 : tts-{version}
    "options": {
        "stream": False,
        "speaker_id": "voice1",
        # speaker_id 선택 옵션 : voice1, voice2, voice3, voice4
        "speed": 1,
        "segment_gap": 0.05
  }
}

response = requests.post(url, headers=headers, data=json.dumps(data))

Request Body(*required)

Name

Type

Description

data*

string

사용자 입력 텍스트

options["stream"] (Default to false)

bool

음성의 출력 방식을 결정하는 매개변수 true 설정하게 되면 생성 되는 음성파일을 청크 단위로 출력하게 됩니다.

options["speaker_id"] (Default to voice1)

object

생성 음성 목소리 옵션 - voice1, voice2, voice3, voice4

options["speed"] (Default to 1)

number

모델이 생성하는 음성의 재생속도

options["segment_gap"] (Default to 0.05)

number

모델이 생성하는 음성의 발화 간격

Response

# stream 옵션에 따라 두 가지 데이터 형식으로 출력 됩니다.
*sample rate : 16000Hz

options["stream"] = false  
  - WAV 형식의 binary data 

options["stream"] = true  
  - PCM 형식의 binary data

API 호출에서 발생하는 오류는 API Error code 페이지를 확인해주세요.

PreviousSTT NextOCR

Last updated 11 days ago