"Science is an error-correcting process." β Charles S. Peirce
I am a Machine Learning Researcher and PhD candidate at Tampere University, specializing in Signal Processing and Machine Learning.
My research focuses on multimodal learning, representation learning, and audio understanding. My work involves developing and optimizing deep learning models for audio classification, sound event detection, multimodal alignment/grounding, and cross-modal information retrieval.
- π§ Machine Learning for Audio Understanding (classification, detection, retrieval, generation)
- π Self-Supervised Representation Learning
- π Multimodal Learning (audio + text + image + video)
- π§© Low-Resource Learning (zero-shot, few-shot)
- π» Programming: Python, Java, JavaScript, SQL, GDScript
- βοΈ Machine Learning: PyTorch, TensorFlow, scikit-learn, Ray Tune, MLflow
- π£οΈ Audio & NLP: librosa, torchaudio, NLTK
- π Data Analysis: NumPy, SciPy, Pandas, Jupyter, Matplotlib
- π Web & Backend: Java EE, Spring, Hibernate, Django, Flask, Gradio
- π± GUI & Game Development: PySide6, Godot Engine
- βοΈ Databases & DevOps: MySQL, PostgreSQL, Linux, Docker, Git
- 𧬠Audio-Text Semantic Alignment using Unsupervised Learning
- π Negative Sampling in Contrastive Learning of Audio-Text Representations
- 𦻠Subjective Evaluation of Audio-Text Semantic Relevance
- β»οΈ Estimating Audio-Text Semantic Relevance through Audio Captions
- π DCASE 2025 Challenge Task 6: Language-Based Audio Retrieval
- π DCASE 2024 Challenge Task 8: Language-Based Audio Retrieval
- π DCASE 2023 Challenge Task 6: Automated Audio Captioning and Language-Based Audio Retrieval
- π DCASE 2022 Challenge Task 6: Automated Audio Captioning and Language-Based Audio Retrieval
Happy to discuss multimodal ML, applied AI, or the challenges of building scalable AI systems. Whether you're hacking on a side project, exploring new ideas, or working in research β feel free to reach out, I'd love to exchange thoughts!
π« Email: huang.xie@outlook.com
π Google Scholar: scholar.google.com/citations?user=_wmP81AAAAAJ
π LinkedIn: linkedin.com/in/huang-xie-28b7872bb