Mahtab Farrokh


Machine Learning Engineer

I am a Machine Learning Resident at Amii, focusing on utilizing OCR, RAG, and LLMs to streamline legal processes for Clio-Cloud-Based Legal Technology. I am passionate about creating impactful solutions, especially in the healthcare sector, and I am always keen to learn and grow in areas that merge ML and real-world applications. I am currently looking for my next job opportunity in machine learning.

Interests: Machine Learning, Natural Language Processing (NLP), Large Language Models (LLMs), Applied ML in Healthcare, Computer Vision and Medical Imaging

Click here to download my CV!



Education

University of Alberta

Master of Computer Science
Computer Science Dept. under supervision of Dr. Russell Greiner

  • Introduced a novel pipeline for prostate cancer (PCa) recurrence prediction using tissue images, by utilizing contrastive self‑supervised learning improved PCa prediction by 7%, resulting in a publication in the Journal of Pathology Informatics.
  • Proposed a novel and effective evaluation metric for individualized survival probability prediction published in ICML.
2021 - 2023

Amirkabir University of Technology

Bachelor of Computer Engineering
Computer Engineering Dept. - Software Engineering

  • Designed a graph‑based multi‑label classification algorithm to improve a symptom checker’s diagnosis accuracy by 9%.
2015 - 2020

Interests

My interests are as follows:

  • Machine Learning
  • Applied ML in Healthcare
  • Medical Imaging
  • Natural Language Processing
  • Computer vision


Based on my enthusiasm for machine learning subjects, I successfully completed the following courses on Coursera:


Skills

Languages: Python3, C/C++, Node.js, Matlab, Go
Frameworks and Tools: Langchain, LLamaIndex, PyTorch, Keras, CUDA, Numpy, Scikit-learn, HuggingFace, OpenCV, Pandas, JAX, Jupyter notebook
Databases: MySQL, MongoDB, PostgreSQL, Redis
Web Programming: React, Express with Node.JS, HTML5, CSS, JavaScript, Django with Python
Operating Systems: Ubuntu, MacOS, Windows
Typesetting Tools: Vim, Latex, Microsoft office(Word, Powerpoint, Excel), Prezi

Work Experiance

Machine Learning Engineer

Alberta Machine Intelligence Institute (Amii) and Clio Cloud-Based Legal Technology
  • Led end‑to‑end development of an LLM pipeline leveraging GPT‑4 and Langchain to automate legal document analysis, including timeline summarization, entity recognition, and date extraction with 83% accuracy.
  • Developed Docker‑based deployment pipeline with FastAPI, successfully transitioning to the production environment.
  • Developed strong client communication and decision-making skills through leading weekly client meetings.
  • Tools: Langchain, OpenAI, Docker, Transformers, FastAPI, Python, MLflow
  • Jul 2024 - Now

    Machine Learning Intern

    Alberta Machine Intelligence Institute (AMII) and MDA Space
  • Developed and optimized a Retrieval‑Augmented Generation (RAG) chatbot for answering engineering questions.
  • Performed prompt engineering on the RAG chatbot, utilizing hybrid search methods and reranking models, to enhance information retrieval accuracy by 20%.
  • Tools: Langchain, HuggingFace, Open‑Source LLMs, Chroma Vector DB, Milvus, Python
  • Mar 2024 - Jun 2024

    Artificial Intelligence Developer

    Tebinja
  • Designed and implemented a web‑based symptom checker using ensemble models, including DQN (Deep Q‑network), Deep Neural Network, achieving 74% accuracy.
  • Tools: Pytorch, Scikit‑Learn, Python, Node.js, React, CSS.
  • Oct 2017 - May 2019


    Publications

  • Exploring Language‑Agnostic Speech Representations Using Domain Knowledge for Detecting Alzheimer’s Dementia
  •   Accepted at 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing - [paper link]

  • An Effective Meaningful Way to Evaluate Survival Models
  •   Accepted at 2023 ICML - [paper link]

  • Learning to Predict Prostate Cancer Recurrence from Tissue Images
  •   Accepted at Journal of Pathology Informatics 2023 - [paper link]


    Teaching Experiance

    Mila ‑ AI4Good Lab

    Teacher Assistant (2023) and Mentor (2024)
    • Designed lectures and served as a mentor for a training program for women and gender‑diverse people across Canada.
    • Led interactive discussions to ensure students’ understanding of Neural Networks, Convolutional Neural Networks, Residual Neural Networks, Transformers, and more.
    2023 - 2024

    University of Alberta

    Teacher Assistant
    • Computer Vision, Instructor: Dr. Martin Jagersand
    • Intelligent System, Instructor: Dr. James Wright
    • Introduction to the Foundations of Computation II, Instructor: Dr. Megan Flanders
    2021 - 2022

    Amirkabir University of Technology

    Teacher Assistant
    • Signals & Systems, Instructor: Dr. Mehdi Rasti
    • Artificial Intelligence & Expert SystemsSystems, Instructor: Dr. Ahmad Nikabadi
    • Engineering Statistics, Instructor: Dr. Maryam Amir Haeri
    • Data Structure & Algorithms, Instructor: Dr. Mehdi Dehghan
    • Logic Circuits, Instructors: Dr. Sedighi, Dr. Sabaei and Dr. Saheb Zamani
    • Introduction to Programming, Instructor: Dr. Bahador Bakhshi
     
    Teacher
    • Introduction to Python3 programming
    2016 - 2020

    Extracurricular Activity

    Great Cycle Challange

    Volunteer

    Biked more than 600 Km, and raised more than $1.3k to support kids with cancer. [My fund raising page]

    2021 - 2024

    SSC

    Member

    Elected as a member of Students Scientific Chapter(SSC), CEIT Department, Amirkabir University of Technology

    2017 - 2018

    Linux Festival

    Member of Execution Committee

    9th and 8th National AUT Linux Festival

    Spring 2016, 2017

    ACM ICPC

    Member of Execution Committee

    18nd, 17nd, and 16nd International AUT ACM ICPC.

    Fall 2016, 2017, 2018

    DMC

    Member of Execution Committee

    1st Data Mining Cup

    Fall 2017