ARCHITECTURAL APPROACHES TO BUILDING SCALABLE VOICE IDENTIFICATION SYSTEMS AND THEIR IMPACT ON IT PROJECT ECONOMISC
DOI:
https://doi.org/10.31732/2663-2209-2025-80-199-211Keywords:
voice identification, microservice architecture, cloud computing, CAPEX, OPEX, SLA, TCO, ML frameworksAbstract
The article examines architectural approaches to building scalable voice identification systems and their impact on the economics of IT projects in the field of software engineering and project management. The relevance of the topic is driven by the rapid growth of the voice services market and the need for businesses to simultaneously ensure high accuracy of biometric authentication, compliance with SLA requirements for latency and availability, as well as control of the total cost of ownership (TCO). The theoretical foundation of the study includes modern research on voice biometrics systems, microservice and cloud-native architectures, as well as cost management approaches for CAPEX/OPEX in cloud IT solutions. The aim of the article is to develop a conceptual model for correlating architectural decisions (monolithic vs. microservice architecture, on-premise vs. cloud deployment, use of modern ML frameworks) with key economic indicators of an IT project (CAPEX, OPEX, TCO, scaling-related savings, SLA non-compliance risks). The research methodology is based on a systematic analysis of scientific sources, comparative architectural analysis, formalization of economic dependencies, and constructing a generalized cost model for various deployment scenarios of voice identification systems. The obtained results include: a structural model of a typical voice identification system; an analytical comparison of monolithic and microservice architectures in terms of their impact on CAPEX/OPEX; an integrated economic model for evaluating TCO across four deployment options: monolithic on-premise, microservices on-premise, monolithic in the cloud (IaaS), and cloud-native/serverless architecture. It has been shown that transitioning to a microservice cloud-native architecture reduces the share of CAPEX and shifts costs toward a more controllable OPEX structure.
Downloads
References
Amzur. (2024). Comparing CapEx & OpEx cloud models: Cloud cost optimization strategy. Amzur.
AWS. (2021). Optimizing enterprise economics with serverless architectures (AWS Whitepaper). Amazon Web Services.
CloudZero. (2023). CapEx vs. OpEx in the cloud: 10 key differences. CloudZero.
Gill, S. S., Singh, I., Chana, I., Buyya, R., et al. (2024). Modern computing: Vision and challenges. Journal of Modern Computing, 1(1), 1–35. [https://www.sciencedirect.com/science/article/pii/S2772503024000021]
Greenberg, C. S., Mason, L. R., Sadjadi, S. O., & Reynolds, D. A. (2020). Two decades of speaker recognition evaluation at the National Institute of Standards and Technology. Computer Speech & Language, 60, 101032. [https://colab.ws/articles/10.1016%2Fj.csl.2019.101032]
Guha, A. (2025, October 6). Building scalable voice AI: From MVP to enterprise. TringTring.AI. [https://tringtring.ai/blog/technical-deep-dive/building-scalable-voice-ai-from-mvp-to-enterprise]
Lee, C., et al. (2024). A systematic literature review on the strategic shift to cloud ERP leveraging microservice architecture and MSPs for resilience and agility. Electronics, 13(14), 2885. [https://www.mdpi.com/2079-9292/13/14/2885]
Liu, Y., et al. (2019). The THUEE system for NIST 2019 speaker recognition evaluation CTS challenge. arXiv preprint arXiv:1912.11585. [https://arxiv.org/pdf/1912.11585]
Microsoft. (2025). Cost efficiency considerations for your cloud adoption strategy. Microsoft Azure Architecture Center.[ https://learn.microsoft.com/en-us/azure/cloud-adoption-framework/strategy/inform/cost-efficiency]
Someshwara Team. (2025, July 11). Building scalable voice AI systems: Best practices and challenges. Someshwara Tech Blog. [https://www.someshwara.com/blog/building-scalable-voice-ai]
TensorFlow. (n.d.). TensorFlow: An end-to-end open source platform for machine learning. Google Open Source.[ https://opensource.google/projects/tensorflow]
PyTorch Foundation. (n.d.). PyTorch: An open source deep learning framework. PyTorch / Linux Foundation AI. [https://ai.meta.com/tools/pytorch]
Google Cloud. (2020, May 15). Principles of cloud cost optimization. Google Cloud Blog. [https://cloud.google.com/blog/topics/cost-management/principles-of-cloud-cost-optimization]
Milvus. (2023). How can microservices architectures benefit audio search applications? Milvus AI Quick Reference. [https://milvus.io/ai-quick-reference/how-can-microservices-architectures-benefit-audio-search-applications]
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.