Combination of Long-Term and Short-Term Features for Age Identification from Voice

doi:10.4316/AECE.2018.02013

FACTS & FIGURES

JCR Impact Factor: 0.700
JCR 5-Year IF: 0.700
SCOPUS CiteScore: 2.0
Issues per year: 3
Current issue: Jun 2025
Next issue: Nov 2025
Avg review time: 88 days
Avg accept to publ: 60 days
APC: 300 EUR

PUBLISHER

Stefan cel Mare
University of Suceava

Faculty of Electrical Engineering and
Computer Science

13, Universitatii Street
Suceava - 720229
ROMANIA

Print ISSN: 1582-7445
Online ISSN: 1844-7600
WorldCat: 643243560
doi: 10.4316/AECE

TRAFFIC STATS

3,725,525 unique visits 1,377,358 downloads
Since November 1, 2009

Robots online now
AhrefsBot
bingbot
Googlebot
PetalBot
SemrushBot

SCOPUS CiteScore

SJR SCImago RANK

LINKS

AECE on Wikipedia
DAS Conference
DAS on Wikipedia
EMCLab Laboratory
Hard & Soft Contest

TEXT LINKS

MOST RECENT ISSUES

Volume 25 (2025)

» Issue 2 / 2025

» Issue 1 / 2025

Volume 24 (2024)

     »   Issue 4 / 2024

     »   Issue 3 / 2024

     »   Issue 2 / 2024

     »   Issue 1 / 2024

Volume 23 (2023)

     »   Issue 4 / 2023

     »   Issue 3 / 2023

     »   Issue 2 / 2023

     »   Issue 1 / 2023

Volume 22 (2022)

     »   Issue 4 / 2022

     »   Issue 3 / 2022

     »   Issue 2 / 2022

     »   Issue 1 / 2022

Volume 21 (2021)

     »   Issue 4 / 2021

     »   Issue 3 / 2021

     »   Issue 2 / 2021

     »   Issue 1 / 2021

View all issues

FEATURED ARTICLE

Analysis of Comprehensive Loss Model of Dry-type Transformer Based on Combined Objective Weighting Method, SHAO, L., WANG, S., LIU, H., LI, J., LI, C.
Issue 3/2024
AbstractPlus

SAMPLE ARTICLES

ROSMutation: Mutation Based Automated Testing for ROS Compatible Robotic Software, YAYAN, U.
Issue 3/2023
AbstractPlus

Frequency Domain Horizontal Cross Correlation Analysis of RSA, AKALP KUZU, E., TANGEL, A., ORS YALCIN, S. B.
Issue 2/2022
AbstractPlus

Quality of Experience Assessment for HTTP Based Adaptive Video Streaming, ARSENOVIC, M., RIMAC-DRLJE, S.
Issue 1/2023
AbstractPlus

Sum-Log Stopping Criterion for Log-MAP Turbo Decoding, OUARDI, A.
Issue 1/2023
AbstractPlus

Research on the Demands of GFM Converters Considering the Stability Enhancement for Large Receiving Power Grids, SUN, W., WANG, Q., LIU, Q., GE, Y., CAI, H., HAN, X., XIE, Z.
Issue 1/2025
AbstractPlus

Performance Analysis of Electro-Impulse De-icing Device for Overhead Ground Wire, ZHOU, X., ZHU, Y., SUN, S., CAI, X.
Issue 4/2022
AbstractPlus

TOP ARTICLES

Most cited in WOS »

Most cited in SCOPUS »

Combination of Long-Term and Short-Term Features for Age Identification from Voice

BUYUK, O. , ARSLAN, M. L.

Extra paper information in

Click to see author's profile in

SCOPUS,

IEEE Xplore,

Web of Science

Download PDF (1,172 KB) | Citation | Downloads: 1,123 | Views: 6,742

Author keywords
feature extraction, Gaussian mixture model, neural networks, speech processing, support vector machines

References keywords
processing(20), speaker(19), speech(16), recognition(14), signal(13), language(12), deep(9), verification(8), neural(8), vector(7)
No common words between the references section and the paper title.

About this article
Date of Publication: 2018-05-31
Volume 18, Issue 2, Year 2018, On page(s): 101 - 108
ISSN: 1582-7445, e-ISSN: 1844-7600
Digital Object Identifier: 10.4316/AECE.2018.02013
Web of Science Accession Number: 000434245000013
SCOPUS ID: 85047853422

Abstract

Full text preview

In this paper, we propose to use Gaussian mixture model (GMM) supervectors in a feed-forward deep neural network (DNN) for age identification from voice. The GMM is trained with short-term mel-frequency cepstral coefficients (MFCC). The proposed GMM/DNN method is compared with a feed-forward DNN and a recurrent neural network (RNN) in which the MFCC features are directly used. We also make a comparison with the classical GMM and GMM/support vector machine (SVM) methods. Baseline results are obtained with a set of long-term features which are commonly used for age identification in previous studies. A feed-forward DNN and an SVM are trained using the long term features. All the systems are tested using a speech database which consists of 228 female and 156 male speakers. We define three age classes for each gender; young, adult and senior. In the experiments, the proposed GMM/DNN significantly outperforms all the other DNN types. Its performance is only comparable to the GMM/SVM method. On the other hand, experimental results show that age identification performance is significantly improved when the decisions of the short-term and long-term systems are combined together. We obtain approximately 4% absolute improvement with the combination compared to the best standalone system.

References

Cited By

Web of Science® Times Cited: 7 [View]
View record in Web of Science® [View]
View Related Records® [View]

Updated today

SCOPUS® Times Cited: 12
View record in SCOPUS® [Free preview]
View citations in SCOPUS® [Free preview]

Updated 2 days, 10 hours ago

[1] Speaker age and gender recognition using 1D and 2D convolutional neural networks, Yücesoy, Ergün, Neural Computing and Applications, ISSN 0941-0643, Issue 6, Volume 36, 2024.
Digital Object Identifier: 10.1007/s00521-023-09153-0 [CrossRef]

[2] Age Estimation from Speech Using Tuned CNN Model on Edge Devices, Durgam, Laxmi Kantham, Jatoth, Ravi Kumar, Journal of Signal Processing Systems, ISSN 1939-8018, Issue 10, Volume 96, 2024.
Digital Object Identifier: 10.1007/s11265-024-01929-4 [CrossRef]

[3] Image Retrieval using One-Dimensional Color Histogram Created with Entropy, KILICASLAN, M., TANYERI, U., DEMIRCI, R., Advances in Electrical and Computer Engineering, ISSN 1582-7445, Issue 2, Volume 20, 2020.
Digital Object Identifier: 10.4316/AECE.2020.02010 [CrossRef] [Full text]

[4] Age and Gender Estimation Through Speech: A Comparison of Various Techniques, Shabbir, Maliha, Hussain, Amjad, Khan, Maqsood Muhammad, 2023 18th International Conference on Emerging Technologies (ICET), ISBN 979-8-3503-2817-2, 2023.
Digital Object Identifier: 10.1109/ICET59753.2023.10374670 [CrossRef]

[5] Technology as Infrastructure for Dehumanization:, Oviatt, Sharon, Proceedings of the 2021 International Conference on Multimodal Interaction, ISBN 9781450384810, 2021.
Digital Object Identifier: 10.1145/3462244.3482855 [CrossRef]

Updated 2 days, 10 hours ago

Disclaimer: All information displayed above was retrieved by using remote connections to respective databases. For the best user experience, we update all data by using background processes, and use caches in order to reduce the load on the servers we retrieve the information from. As we have no control on the availability of the database servers and sometimes the Internet connectivity may be affected, we do not guarantee the information is correct or complete. For the most accurate data, please always consult the database sites directly. Some external links require authentication or an institutional subscription.

Web of Science^® is a registered trademark of Clarivate Analytics, Scopus^® is a registered trademark of Elsevier B.V., other product names, company names, brand names, trademarks and logos are the property of their respective owners.

Copyright ©2001-2025
Faculty of Electrical Engineering and Computer Science
Stefan cel Mare University of Suceava, Romania

All rights reserved: Advances in Electrical and Computer Engineering is a registered trademark of the Stefan cel Mare University of Suceava. No part of this publication may be reproduced, stored in a retrieval system, photocopied, recorded or archived, without the written permission from the Editor. When authors submit their papers for publication, they agree that the copyright for their article be transferred to the Faculty of Electrical Engineering and Computer Science, Stefan cel Mare University of Suceava, Romania, if and only if the articles are accepted for publication. The copyright covers the exclusive rights to reproduce and distribute the article, including reprints and translations.

Permission for other use: The copyright owner's consent does not extend to copying for general distribution, for promotion, for creating new works, or for resale. Specific written permission must be obtained from the Editor for such copying. Direct linking to files hosted on this website is strictly prohibited.

Disclaimer: Whilst every effort is made by the publishers and editorial board to see that no inaccurate or misleading data, opinions or statements appear in this journal, they wish to make it clear that all information and opinions formulated in the articles, as well as linguistic accuracy, are the sole responsibility of the author.

Website loading speed and performance optimization powered by:

PageSpeed

.ro

Menu:

Combination of Long-Term and Short-Term Features for Age Identification from Voice