Skip to main content

About VoicePing

VoicePing is revolutionizing global business communication by developing the foundational Voice AI Model through cutting-edge technologies. Founded in 2019, we’ve grown to serve 1000+ enterprise customers with our industry-leading speech translation infrastructure. Learn More:

Company Achievements

  • 1000+ paying enterprise customers
  • $2.3M USD total investment as SEED
  • Recognition from global accelerators (500 Global, Rainmaking APAC, AlchemistX)
VoicePing Team Overview

Our diverse team includes elite AI developers from SoftBank, Rakuten, IBM, and more


Technical Infrastructure Overview

Core AI Stack

ComponentTechnology
Speech RecognitionCustom-trained transformers optimized for Asian languages
Translation EngineFine-tuned LLMs with domain-specific training
Text-to-SpeechNatural emotional synthesized audio generation model
Language ModelsSpecialized models for business terminology and technical vocabulary
Audio ProcessingReal-time audio streaming with custom DSP pipelines

Development Stack

Frontend:
  • Core: React, TypeScript, Flutter (iOS, Android)
  • Real-time: WebSocket, WebRTC for audio streaming
  • UI Components: Ant Design, Flutter
Backend:
  • Primary: Node.js (TypeScript), Python (FastAPI), Golang
  • APIs: REST
DevOps & Infrastructure:
  • Cloud Platforms: AWS, GCP, Azure, VPS, OnPremise
  • CI/CD: GitHub Actions, Azure CodePipeline
AI/ML Pipeline:
  • Tools: PyTorch, Hugging Face, Llama, Unsloth, Qwen

Core Responsibilities

Infrastructure Development

  • Build and maintain real-time audio processing pipelines
  • Implement high-performance API endpoints for translation services
  • Develop and optimize WebSocket connections for live translation
  • Create and maintain containerized microservices

AI Integration

  • Collaborate with ML engineers on model deployment
  • Implement model serving infrastructure
  • Optimize inference pipelines for low latency
  • Develop monitoring systems for model performance

AI Model Creation

  • Design data pipeline strategy and collect high-quality data
  • Adjust AI model parameters (hyperparameters, architecture) to improve VRAM efficiency and inference speed
  • Create model evaluation strategy, analyze results, and determine the right direction
  • Research latest OSS models and papers to improve current models
  • Publish research articles and contribute to OSS ecosystem

Application Development

  • Build responsive web interfaces for real-time translation
  • Implement offline translation capabilities
  • Create developer tools and SDKs
  • Develop internal dashboards for system monitoring

Technical Requirements

Must Have

  • 0.5-3 years of experience with React and TypeScript (or capability to create quick frontend prototypes)
  • Some experience with Node.js and Python or similar frameworks
  • Basic understanding of CI/CD principles
  • Experience with cloud platforms (AWS/GCP/Azure)
  • Strong interest in AI technologies and ecosystems

Nice to Have


Current Technical Challenges

  • Optimizing real-time audio processing for low latency
  • E2E model creation for emotional speech translation
  • E2E model creation for AI voice agent bot
  • Scaling ML model inference to minimize inference speed with low energy
  • Implementing efficient offline translation
  • Building robust monitoring systems to support our infrastructures
  • Developing SDK for third-party integrations
  • Large-scale multilingual training data collection and creation

Growth Opportunities

  • Deep dive into ML infrastructure and ecosystem
  • Learn advanced audio processing
  • Machine learning training framework
  • Data pipeline for AI model training data
  • Master distributed systems

Benefits

  • Competitive salary based on experience: 7,000,000 - 12,000,000
  • Flexible work arrangements (hybrid/remote)
  • Health insurance
  • Regular technical workshops

Location

  • Tokyo (primary)
  • Singapore office (expanding) - if you have Singapore residency
  • Remote work options available (Currently 85% remote, 15% office)

How to Apply

Send to [email protected]:
  1. Resume/CV with technical projects
  2. Please clarify how you can contribute and what you want to do
We’re looking for developers passionate about building the future of voice AI and real-time translation technology!