DataCATz is a collaborative project run by a group of friends with overlapping intellectual and professional interests. We started this blog with two aims: provide a reference for problems and possible solutions in our domain of interest and increase our visibility to the wider data analyst community. Each contributing member to this blog comes from a different formal training background e.g. Medicine, Biological Sciences, Engineering, Physics and Chemistry; and have over time converged towards analytics. Please feel free to browse the topics, and leave helpful comments.

Rossella Melchiotti
20170424_181740Senior Bioinformatician/Data Scientist. After a master in computer engineering from Politecnico di Milano (Italy) and a master in general engineering from Ecole Centrale de Lyon (France), both achieved under the T.I.M.E (Top Industrial Managers for Europe) program, I follewed my passion for human healthcare and obtained a PhD in translational medicine from the Universita’ di Milano Bocconi (Italy) in collaboration with Singapore Immunology Network.

My interests are varied and encompass multiple domains: from python/R coding to databases, from multivariate linear regression and unsupervised multidimensional clustering methods to immunology and healthcare. In my current position I mainly apply my statistical and computational skills to the analysis of transcriptomics (microarrays, RNA-Seq) and single-cell proteomics (CyTOF) datasets. I also have a strong interest in soft skills development and, in particular, in leadership models and strategies to improve productivity.

Bitbucket Repository   Linkedin Profile  Research Gate Profile

Umar Niazi


I am a data scientist with a MSc (with distinction) from University of Leicester and PhD from University of Manchester with extensive experience in modelling big-data using Machine learning, Bayesian statistical and Graph/Network based methods for estimation, decision making, clustering and dimension reduction. My interests include: Bayesian statistics; Graphs and networks; Statistical and Machine Learning; Clustering and Dimension Reduction; Markov Chain Monte Carlo (MCMC) simulations; Biomarker discovery; Bioinformatics & Cheminformatics; Longitudinal Case-Control Studies; and Next Generation Sequencing (NGS) data analysis.

Github Repository  Linkedin Profile  Research Gate Profile

Sanjana Sood


I am a London-based data analyst. Through my studies, I am trained in Bioinformatics with three years of hands-on experience in a health care diagnostics company.  I’m into anything intriguing and interesting.  A passion of mine is visualising information for ease of perception and making sense out of “intangible” things like numbers, statistics, ideas and emotions. Through this blog, I intend to share my learnings, tips and tricks of playing with different types of data in a simplistic manner.


The information contained in this website is for general information purposes only. Any reliance you place on such information is therefore strictly at your own risk.

In no event will we be liable for any loss or damage including without limitation, indirect or consequential loss or damage, or any loss or damage whatsoever arising from loss of data or profits arising out of, or in connection with, the use of this website.


Copyright (C) 2017 DataCATz
Permission is granted to copy, distribute and/or modify this document
under the terms of the GNU Free Documentation License, Version 1.3
or any later version published by the Free Software Foundation;
with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts.
A copy of the license is can be seen at <https://www.gnu.org/licenses/fdl-1.3.html>.


Powered by WordPress.com.

Up ↑