Mayumi Ohta, M.A.
I am a PhD student under the supervision of Prof. Dr. Stefan Riezler and member of the Statistical NLP Group.
News
- July 2022: I’ve attended Lisbon Machine Learning School (LxMLS) 2022.
- June 2022: ITmedia @IT にて、寄稿記事 「Python+Pytorch」と「JoeyNMT」で学ぶニューラル機械翻訳 が公開されました。
- June 2022: Speech-to-Text modules on JoeyNMT are now available! Please check out this walk-through tutorial.
Research Interests
- Personalized Machine Learning
- Differential Privacy
- Machine Translation
- Speech Recognition
Curriculum Vitae
Education
2022- | PhD Student, Computational Linguistics |
Heidelberg University, Germany | |
2017 | Master of Arts, Computational Linguistics |
Heidelberg University, Germany | |
2014 | Bachelor of Arts, Computational Linguistics |
Heidelberg University, Germany | |
2011 | Research Student, Information Studies |
University of Tokyo, Japan |
Employment
08/2022 – | Research Intern (part time) |
Fraunhofer Institute for Systems and Innovation Research, Karlsruhe, Germany | |
Data Science | |
08/2021 – 07/2022 | NLP Intern |
Yaraku Inc., Tokyo, Japan | |
Machine Translation | |
05/2019 – 07/2021 | Research Assistant |
Institute for Computational Linguistics, Heidelberg University, Germany | |
10/2019 – 12/2019 | Research Intern |
NEC Labs Europe, Heidelberg, Germany | |
Information Retrieval, Slot Filling | |
01/2018 – 04/2019 | Research Scientist (full time) |
Nuance Communications Inc., Aachen, Germany | |
Language modeling | |
11/2016 – 06/2017 | Working Student |
Leibniz ScienceCampus, Heidelberg, Germany [ LiMo ] | |
Semantic role linking | |
08/2013 – 10/2016 | Working Student |
IT department at the Centre for East Asian Studies, Heidelberg, Germany [ ZO ] | |
Web administration and application development | |
08/2014 – 02/2015 | Software Developer (full time) |
Publications Office at the Karl Jaspers Centre, Heidelberg, Germany [ HeiUp ] | |
Digital publishing | |
03/2013 | Intern |
Department of Scalability and Performance, SAP Inc., Walldorf, Germany | |
12/2007 – 03/2009 | Working Student |
Educational Planning Office at University of Tokyo, Tokyo, Japan | |
Video editorial, MOOC contents development | |
04/2007 – 03/2009 | Trainee |
Suzuki Audio GmbH, Tokyo, Japan | |
Video editorial, Broadcast operations |
Teaching
2022/23 Winter | Statistical Methods for Computational Linguistics (Teaching Assistant) |
2022 Summer | Neural Machine Translation in Practice (AIMS Senegal) |
2021 Summer | Mathematical Foundations of Computational Linguistics (Lecturer) |
2020/21 Winter | Generalization in Deep Learning (Lecturer) |
2020 Summer | Mathematical Foundations of Computational Linguistics (Lecturer) |
2017 Summer | Interactive Neural Machine Translation (Tutorial for IUED Studients) |
Publications
- JoeyS2T: Minimalistic Speech-to-Text Modeling with JoeyNMTProceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP): System Demonstrations, Abu Dhabi, 2022
@inproceedings{joeys2t2022, author = {Ohta, Mayumi and Kreutzer, Julia and Riezler, Stefan}, title = {Joey{S2T}: Minimalistic Speech-to-Text Modeling with {JoeyNMT}}, journal = {Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing {(EMNLP)}: System Demonstrations}, year = {2022}, city = {Abu Dhabi}, url = {https://arxiv.org/abs/2210.02545} }
- On-the-Fly Aligned Data Augmentation for Sequence-to-Sequence ASRProceedings of the 22th Annual Conference of the International Speech Communication Association (INTERSPEECH), Brno, Czech Republic, 2021
@inproceedings{lam2021, author = {Lam, Tsz Kin and Ohta, Mayumi and Schamoni, Shigehiko and Riezler, Stefan}, title = {On-the-Fly Aligned Data Augmentation for Sequence-to-Sequence ASR}, journal = {Proceedings of the 22th Annual Conference of the International Speech Communication Association}, journal-abbrev = {INTERSPEECH}, year = {2021}, city = {Brno}, country = {Czech Republic}, url = {https://arxiv.org/abs/2104.01393} }
- Sparse Perturbations for Improved Convergence in Stochastic Zeroth-Order OptimizationThe 6th International Conference on Machine Learning, Optimization, and Data Science (LOD), Siena, Italy, 2020
@article{ohta2020, author = {Ohta, Mayumi and Berger, Nathaniel and Sokolov, Artem and Riezler, Stefan}, year = {2020}, title = {Sparse Perturbations for Improved Convergence in Stochastic Zeroth-Order Optimization}, journal = {The 6th International Conference on Machine Learning, Optimization, and Data Science}, journal-abbrev = {LOD}, city = {Siena, Italy}, url = {https://arxiv.org/abs/2006.01759} }
- Sparse Stochastic Zeroth-Order Optimization with an Application to Bandit Structured PredictionarXiv preprint arXiv:1806.04458, 2018 (Preprint)
@inproceedings{sokolov2018, author = {Sokolov, Artem and Hitschler, Julian and Ohta, Mayumi and Riezler, Stefan}, title = {Sparse Stochastic Zeroth-Order Optimization with an Application to Bandit Structured Prediction}, journal = {arXiv preprint arXiv:1806.04458}, note = {Preprint}, year = {2018}, url = {https://arxiv.org/abs/1806.04458} }
- Otedama: Fast Rule-based Pre-Ordering for Machine TranslationThe Prague Bulletin of Mathematical Linguistics (PBML), 106, 159–168, 2016
@article{hitschler2016b, author = {Hitschler, Julian and Jehl, Laura and Karimova, Sariya and Ohta, Mayumi and K\"{o}rner, Benjamin and Riezler, Stefan}, title = {Otedama: Fast Rule-based Pre-Ordering for Machine Translation}, journal = {The Prague Bulletin of Mathematical Linguistics}, journal-abbrev = {PBML}, number = {106}, pages = {159--168}, year = {2016}, url = {http://www.cl.uni-heidelberg.de/~riezler/publications/papers/MTM2016.pdf} }
Invited Talks
- JoeyS2T: Minimalistic Speech-to-Text Modeling with JoeyNMTMasakhane Colloquium, 2021| bib
@misc{ohta:joeys2t:2021, author = {Ohta, Mayumi}, conference = {Masakhane Colloquium}, title = {JoeyS2T: Minimalistic Speech-to-Text Modeling with JoeyNMT}, year = {2021} }
Articles (in Japanese)
- 「Python+PyTorch」と「JoeyNMT」で学ぶニューラル機械翻訳ITmedia @IT, June 29, 2022
@article{ohta:joeynmt2.0:2022, author = {Ohta, Mayumi}, title = {「Python+PyTorch」と「JoeyNMT」で学ぶニューラル機械翻訳}, journal = {ITmedia @IT}, type = {Article}, number = {June 29}, year = {2022}, url = {https://atmarkit.itmedia.co.jp/ait/articles/2206/29/news008.html} }