xieh97 xieh97

👋 Hi, I'm Huang Xie (谢晃)

"Science is an error-correcting process." — Charles S. Peirce

🎓 About Me

I am a Machine Learning Researcher and PhD candidate at Tampere University, specializing in Signal Processing and Machine Learning.

My research focuses on multimodal learning, representation learning, and audio understanding. My work involves developing and optimizing deep learning models for audio classification, sound event detection, multimodal alignment/grounding, and cross-modal information retrieval.

🧠 Research Interests

🎧 Machine Learning for Audio Understanding (classification, detection, retrieval, generation)
🔍 Self-Supervised Representation Learning
🔄 Multimodal Learning (audio + text + image + video)
🧩 Low-Resource Learning (zero-shot, few-shot)

🛠️ Tech Stack

💻 Programming: Python, Java, JavaScript, SQL, GDScript
⚛️ Machine Learning: PyTorch, TensorFlow, scikit-learn, Ray Tune, MLflow
🗣️ Audio & NLP: librosa, torchaudio, NLTK
📊 Data Analysis: NumPy, SciPy, Pandas, Jupyter, Matplotlib
🌐 Web & Backend: Java EE, Spring, Hibernate, Django, Flask, Gradio
📱 GUI & Game Development: PySide6, Godot Engine
⚙️ Databases & DevOps: MySQL, PostgreSQL, Linux, Docker, Git

🧪 Featured Projects

💬 Let's Connect

Happy to discuss multimodal ML, applied AI, or the challenges of building scalable AI systems. Whether you're hacking on a side project, exploring new ideas, or working in research — feel free to reach out, I'd love to exchange thoughts!

📫 Email: huang.xie@outlook.com
🔗 Google Scholar: scholar.google.com/citations?user=_wmP81AAAAAJ
🔗 LinkedIn: linkedin.com/in/huang-xie-28b7872bb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly