IJNRD Research Journal

WhatsApp
Click Here

WhatsApp editor@ijnrd.org
IJNRD
INTERNATIONAL JOURNAL OF NOVEL RESEARCH AND DEVELOPMENT
International Peer Reviewed & Refereed Journals, Open Access Journal
ISSN Approved Journal No: 2456-4184 | Impact factor: 8.76 | ESTD Year: 2016
Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 8.76 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)

Call For Paper

For Authors

Forms / Download

Published Issue Details

Editorial Board

Other IMP Links

Facts & Figure

Impact Factor : 8.76

Issue per Year : 12

Volume Published : 9

Issue Published : 96

Article Submitted :

Article Published :

Total Authors :

Total Reviewer :

Total Countries :

Indexing Partner

Join RMS/Earn 300

Licence

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License
Published Paper Details
Paper Title: Text-To-Speech Synthesizer and Voice Cloning using Generative Model
Authors Name: Ravishek Kumar Singh , Himanshu Pal , Rohit Raj , Mohd Tariq , Sandeep kumar
Download E-Certificate: Download
Author Reg. ID:
IJNRD_194218
Published Paper Id: IJNRD2305167
Published In: Volume 8 Issue 5, May-2023
DOI:
Abstract: We present a neural network- based text-to-speech (TTS) synthesis system that can synthesise spoken sounds in the voices of many speakers. Our system is made up of three independently trained components: a speaker encoder network that was trained on a speaker verification task using an independent dataset of noisy speech without transcripts from thousands of speakers to generate a fixed-dimensional embedding vector from only seconds of reference speech from a target speaker; a Tacotron- based sequence-to-sequence synthesis network that generates a model spectrogram from text, conditioned on the speaker embedding; and we show that the proposed model can transfer the discriminatively-trained speaker encoder' s knowledge of speaker variability to the multispeaker TTS challenge and synthesis authentic speech from speakers not observed during t r a i n i n g . To g e t t h e o p t i m u m generalisation performance, we quantify the value of training the speaker encoder on a wide and varied speaker set. Finally, we demonstrate that randomly chosen speaker embeddings can synthesis speech in the voices of fresh speakers who are not comparable to those used in training, showing that the model has learnt a high- quality speaker representation.
Keywords: Tacotron, spectrogram, TTS, embeddings.
Cite Article: "Text-To-Speech Synthesizer and Voice Cloning using Generative Model", International Journal of Novel Research and Development (www.ijnrd.org), ISSN:2456-4184, Vol.8, Issue 5, page no.b522-b531, May-2023, Available :http://www.ijnrd.org/papers/IJNRD2305167.pdf
Downloads: 000118755
ISSN: 2456-4184 | IMPACT FACTOR: 8.76 Calculated By Google Scholar| ESTD YEAR: 2016
An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 8.76 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator
Publication Details: Published Paper ID:IJNRD2305167
Registration ID: 194218
Published In: Volume 8 Issue 5, May-2023
DOI (Digital Object Identifier):
Page No: b522-b531
Country: Greater Noida, Uttar Pradesh, India
Research Area: Computer Science & Technology 
Publisher : IJ Publication
Published Paper URL : https://www.ijnrd.org/viewpaperforall?paper=IJNRD2305167
Published Paper PDF: https://www.ijnrd.org/papers/IJNRD2305167
Share Article:
Share

Click Here to Download This Article

Article Preview
Click Here to Download This Article

Major Indexing from www.ijnrd.org
Semantic Scholar Microsaoft Academic ORCID Zenodo
Google Scholar ResearcherID Thomson Reuters Mendeley : reference manager Academia.edu
arXiv.org : cornell university library Research Gate CiteSeerX PUBLON
DRJI SSRN Scribd DocStoc

ISSN Details

ISSN: 2456-4184
Impact Factor: 8.76 and ISSN APPROVED
Journal Starting Year (ESTD) : 2016

DOI (A digital object identifier)


Providing A digital object identifier by DOI
How to Get DOI? DOI

Conference

Open Access License Policy

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License

Creative Commons License This material is Open Knowledge This material is Open Data This material is Open Content

Important Details

Social Media

Licence

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License

Join RMS/Earn 300

IJNRD