Summary of Qualifications

  • Languages: Java, C++, and Python.

  • Data processing and storage: data processing frameworks based on MapReduce, distributed data replication and partitioning, and batch and streaming processing.

  • ML skills: data visualization, predictive analysis, statistical modeling, and deep learning.

Professional Experience

Software Engineer, Google Inc. (2019.11 – Present)

  • Google Fi network connectivity improvement using data-driven approaches.

Senior Research Engineer, Samsung Research America. (2017.05 – 2019.11)

  • Samsung digital health innovation platform.

    • Designed and built ETL pipelines for Samsung health platforms.

    • Mentored interns and junior developers in server-side development and machine learning projects.

    • Techniques: Spark, HBase, HDFS, Phoenix, Java, Scala, Python, SQL, shell script, AWS (EC2, ECR, EKS, EMR, S3, DynamoDB, Aurora, etc.), and machine learning.

Software Engineering Intern, Samsung Research America. (2017.01 – 2017.04)

  • Ads recommendation for Samsung Pay.

    • Proposed, designed, and developed the first machine learning ads recommendation in Samsung Pay.

    • Evaluated multiple machine learning approaches and improved the conversion rate by 18% in production.

    • Techniques: Java, Python, Redshift, SQL, shell script and machine learning.

Web Developer, University of Arkansas for Medical Sciences (2009.01 – 2010.06)

  • Clinical trial systems and and data analysis.

    • Contributed to the development of clinical trial systems through software lifecycles, from requirements definition to production deployment.

    • Developed a report generation portal to support doctors’ decisions based on patients’ data.

    • Techniques: Java, Spring frameworks, MySQL, shell script, and JavaScript.

Education

B.S. in Information Management, Huazhong Agricultural University (2007)

Awards

  • SRA Award for Exceptional Achievement and Outstanding Contributions beyond Normal Job Responsibilities, Samsung Research America 2019

Publication

  • Qinghan Xue, Xiaoran Wang, Samuel Meehan, Jilong Kuang, Jun Gao, and Mooi Choo Chuah. “Recurrent Neural Networks based Obesity Status Prediction Using Activity Data,” in Proceedings of 17th IEEE International Conference on Machine Learning Applications, 2018

  • Xiaoran Wang, Leonardo Jimenez Rodriguez, and Jilong Kuang. “Modeling Topics on Open Source Apache Spark Repositories,” in Proceedings of 18th International Conference on Data Science, 2018

  • Leonardo Jimenez Rodriguez, Xiaoran Wang, and Jilong Kuang. “Insights on Apache Spark usage by mining Stack Overflow questions,” in Proceedings of 2018 IEEE International Congress on Big Data, 2018

  • Xiaoran Wang, Lori Pollock, and K Vijay-Shanker. “Automatically Generating Natural Language Descriptions for Object-related Statement Sequences,” in Proceedings of 24th IEEE International Conference on Software Analysis, Evolution, and Reengineering, 2017

  • Xiaoran Wang, Lori Pollock, and K Vijay-Shanker. “Developing a Model of Loop Actions by Mining Loop Characteristics from a Large Code Corpus,” in Proceedings of 31st IEEE International Conference on Software Maintenance and Evolution, 2015

  • Xiaoran Wang, Lori Pollock, and K Vijay-Shanker. “Automatic Segmentation of Method into Meaningful Blocks to Improve Readability,” in Proceedings of 18th IEEE Working Conference on Reverse Engineering, 2011