About VoicePing
VoicePing is revolutionizing global business communication by developing the foundational Voice AI Model through cutting-edge technologies. Founded in 2019, we’ve grown to serve 1000+ enterprise customers with our industry-leading speech translation infrastructure. Learn More:- Our Product
- Product Development Journey
- J-StarX Silicon Valley Program
- Insights from 500 Global’s J-StarX Program
- Alchemist Accelerator Participation
- Company Profile
Company Achievements
- 1000+ paying enterprise customers
- $2.3M USD total investment as SEED
- Recognition from global accelerators (500 Global, Rainmaking APAC, AlchemistX)

Our diverse team includes elite AI developers from SoftBank, Rakuten, IBM, and more
Technical Infrastructure Overview
Core AI Stack
| Component | Technology |
|---|---|
| Speech Recognition | Custom-trained transformers optimized for Asian languages |
| Translation Engine | Fine-tuned LLMs with domain-specific training |
| Text-to-Speech | Natural emotional synthesized audio generation model |
| Language Models | Specialized models for business terminology and technical vocabulary |
| Audio Processing | Real-time audio streaming with custom DSP pipelines |
Development Stack
Frontend:- Core: React, TypeScript, Flutter (iOS, Android)
- Real-time: WebSocket, WebRTC for audio streaming
- UI Components: Ant Design, Flutter
- Primary: Node.js (TypeScript), Python (FastAPI), Golang
- APIs: REST
- Cloud Platforms: AWS, GCP, Azure, VPS, OnPremise
- CI/CD: GitHub Actions, Azure CodePipeline
- Tools: PyTorch, Hugging Face, Llama, Unsloth, Qwen
Core Responsibilities
Infrastructure Development
- Build and maintain real-time audio processing pipelines
- Implement high-performance API endpoints for translation services
- Develop and optimize WebSocket connections for live translation
- Create and maintain containerized microservices
AI Integration
- Collaborate with ML engineers on model deployment
- Implement model serving infrastructure
- Optimize inference pipelines for low latency
- Develop monitoring systems for model performance
AI Model Creation
- Design data pipeline strategy and collect high-quality data
- Adjust AI model parameters (hyperparameters, architecture) to improve VRAM efficiency and inference speed
- Create model evaluation strategy, analyze results, and determine the right direction
- Research latest OSS models and papers to improve current models
- Publish research articles and contribute to OSS ecosystem
Application Development
- Build responsive web interfaces for real-time translation
- Implement offline translation capabilities
- Create developer tools and SDKs
- Develop internal dashboards for system monitoring
Technical Requirements
Must Have
- 0.5-3 years of experience with React and TypeScript (or capability to create quick frontend prototypes)
- Some experience with Node.js and Python or similar frameworks
- Basic understanding of CI/CD principles
- Experience with cloud platforms (AWS/GCP/Azure)
- Strong interest in AI technologies and ecosystems
Nice to Have
- Audio and Video Digital processing
- Knowledge of ML model deployment and inference optimization
- See our research: Translation Inference Throughput, Translation Bottleneck
- Background in NLP or speech recognition or translation research
- Understanding of high-performance computing and scalable data processing
- See our research: Go WebSocket Proxy
Current Technical Challenges
- Optimizing real-time audio processing for low latency
- E2E model creation for emotional speech translation
- E2E model creation for AI voice agent bot
- Scaling ML model inference to minimize inference speed with low energy
- Implementing efficient offline translation
- Building robust monitoring systems to support our infrastructures
- Developing SDK for third-party integrations
- Large-scale multilingual training data collection and creation
Growth Opportunities
- Deep dive into ML infrastructure and ecosystem
- Learn advanced audio processing
- Machine learning training framework
- Data pipeline for AI model training data
- Master distributed systems
Benefits
- Competitive salary based on experience: 7,000,000 - 12,000,000
- Flexible work arrangements (hybrid/remote)
- Health insurance
- Regular technical workshops
Location
- Tokyo (primary)
- Singapore office (expanding) - if you have Singapore residency
- Remote work options available (Currently 85% remote, 15% office)
How to Apply
Send to [email protected]:- Resume/CV with technical projects
- Please clarify how you can contribute and what you want to do
We’re looking for developers passionate about building the future of voice AI and real-time translation technology!
