Resume

January 1, 0001

Robin Varghese

Whatever the problem, I want to be part of the solution. I enjoy developing software and empowering others in my knowledge space. I bring 10+ years of Full-Stack Developer experience and 5+ years of Leadership experience.

I thrive in an environment of uncertainty which requires constant learning.

Reach Me

Github LinkedIn Email Say Hello
. . . .

Knowledge space

Tech stack

Interests Languages Tools Databases Platforms
Full-stack Development Python Jupyter BigQuery Google Cloud Platform
Data Science, Analytics SQL django AWS Athena docker
Machine learning pandas plotly mySQL Airflow
Generative AI React.js, bootstrap MongoDB Amazon Web Services
Dev team building Javascript tornado, redis
HTML scikit-learn postgreSQL

Experience

  • 16+ years of professional experience developing software, data analytics and solving business problems with technology.

  • 9+ years of leadership experience focused on driving innovation, building and mentoring development teams.

Industry Dive

Senior Software Engineer II

Washington D.C · Mar 2021 – Present · 3+ years

2024

(GCP, docker, python, django, react, BigQuery, jupyter, airflow)

  • Discovered, scoped, found solutions, implemented and launch several nebulous projects(ex. GA4 migration, Recommendation Engine, caching layer).
  • Architecture solutions that enables many potential futures without knowing exactly what the future is(ex Ad spot Fallback, Data Governance).
  • Set long term technical direction to ensure our company always stays ahead of the curve.(ex. leverage our CDP to better serve our readers)

Senior Software Engineer I

Washington D.C · Mar 2021 – March 2023 · 2 years

2023

(GCP, docker, python, django, BigQuery, jupyter, airflow)

  • Showcased my engineering leadership skills by guiding a team of Python developers in creating a new product using React.js(Next.js).
  • Broke down complex problems into potential solutions, knowns, and unknowns, in order to get to solid resolutions faster(ex. partner data pipelines setup).
  • From scratch architected and shipped large services(dev airflow and jupyter docker containers, version control repos, CI/CD pipelines, Custom airflow DAG operators, qa/prod Google composerairflow instances) required to bootstrap our data engineering infrastructure.
  • Consolidated data from over 10+ data sources into our enterprise datalake via airflow DAGs.
  • Developed prototypes to establish company best patterns of pipelines in and out of the datalake. Also mentored and trained team members to develop data using these patterns.
  • Recognized as a prolific contributor to core and side projects(ex. design component editor).
  • Consistently able to act as a force multiplier, by reducing complexity, guiding technical discussions to reach consensus and aligning the team around Objective and Key Results (ex. content gating)

2022

(GCP, python, django, BigQuery, React.js, jupyter, airflow)

  • Trusted mentor and collaborator who fosters a culture of innovation and continuous learning ex. load testing, cloudflare automation, tech show and tell and reducing complexity.
  • Identified and solved important problems on cross-cutting technical features ex. Trackers, Newsletter churn analysis, gitlab CI/CD pipeline stats etc.
  • Inculcated a test driven development culture for internal data pipelines, products and platforms.
  • Researched, Proposed and Delivered new technologies ex.enterprise datalake, Recommendation Engine, Airflow, Design Component System etc.
  • Launched new publications cstoredive.com, proformative.com etc . Also created comprehensive documentation to streamline launch process.
  • Authored several internal guidance documents to capture POCs, incidents and getting started guides.

2021

(GCP, docker, python, django, cloudflare)

  • Facilitated smooth initiative deployments for high impact rollouts(ex. Rocket Loader, Proformative.com) using feature flags and RACI matrix.
  • Developed a system to visualize and trend website performance using lighthouse, pandas and plotly.
  • Optimized website performance based on Google Core Web Vitals and lighthouse recommendations.
  • Built search capabilities using django haystack and websolr. Performed indexing analysis.
  • Provided thorough and thoughtful code reviews for other engineers.
  • Collaborated with our scrum teams to design solutions for stakeholder feature requests.

Arnot Health

Manager, Business Analytics - Innovation

Elmira, New York · Apr 2015 – Mar 2021 · 6 years

2020

(AWS, SQL, python, jupyter, pandas, plotly, scikit-learn etc.)

  • Served as the Principal Software Developer in Information Services(IS) department.
  • Performed rigorous analyses on large, complex data sets for strategic initiatives(marketing, grant proposals etc.)
  • Provided strategic insights, hypotheses, and conclusions based upon findings to leaders at all levels using ipython notebooks, sql, pandas and plotly.
  • Developed an analytics team which excels at problem-framing, problem solving and can change direction quickly.
  • Designed and implemented patterns of best practices for scalable, CI/CD automated, highly performant data platforms and other relevant tech stacks.
  • Developed an enterprise wide application, to self-screen for COVID symptoms. Application platform was designed to be generously scalable and fully serverless harnessing AWS services(lambda functions, dynamodb, simple email service, etc.)
  • Worked on several projects roadmaps and serving as a trusted committer for code for internal development.

2019

(AWS, SQL, Tableau, python, scikit-learn, jupyter, pandas, d3.js etc.)

  • Lead the team developing our next generation prescriptive analytics leveraging our data lake to support chronic disease prevention and management.
  • Automated and maintain business visualizations/dashboards/KPIs.
  • Guide the team in enhancing our data visualization solutions using jupyter, bokeh, HoloViews and PyViz
  • Conducted several market research analysis using Medicare claims dataset and NY SPARCS dataset
  • Lead several projects with AWS to ensure HIPAA compliance and well architect-ed environment(ex. Athena, s3, QuickSight, IAM etc.)
  • Navigated the department in establishing an AWS environment to offload storage and computation needs.
  • Developed a generic event logger to capture events of interests from any place in our entire tech stack.
  • Developed data pipelines to inject GBs of data daily from legacy systems into EDW.
  • Developed scripts to aggregate and benchmark physicians on over 150 KPIs for Ongoing Physician Practice Evaluation.

2018

(Hadoop, SQL, Tableau, python, scikit-learn, django, coffeescript, d3.js etc.)

  • Lead the team through pilot and deployment phases of an in-house Hadoop data lake.
  • Lead the Business Analytics team through projects and providing full engagement of team management.
  • Mentoring senior and junior developers, helped them prioritize their work, gave them actionable feedback and made sure they grow.
  • Provide coaching and direction to analytics team in regard to best practices/approaches for software development, statistics and machine learning techniques.
  • Surface information from across our health care system and to support data-driven decision making at all levels of Arnot Health system.
  • Communicates findings from exploratory and predictive data analysis broadly to Arnot Health leadership.
  • Perform market research and present quantitative analyses of health-care claim databases.
  • Developed over 100 dashboards using tableau
  • Build and deployed numerous production servers using pip, virtualenv and other package managers.

Sr. Integration and Database Analyst, Business Analytics

2017

(python, coffeescript, angular, d3.js, django, redis)

  • Lead development of our next generation analytics platform capable of assimilating and visualize disparate data.
  • Managed data as an enterprise asset, reducing time to find the right data/report and ensure data is trustworthy.
  • Architected and developed a scalable analytics platform, which helps end users to locate, collaborate and share trustworthy insights in a timely fashion.
  • Performing market research and present quantitative analyses of healthcare claims and external databases for historical analysis and trend forecasting.
  • Researched and delivered Proof Of Concept(POC) implementations that explain key technologies.
  • Transitioned Pilot/POC applications to DevOps team for ongoing development and lifecycle management.
  • Developed over 20 applications to collect data via django webforms.
  • Developed automated UI tests and UI automation jobs using Selenium.
  • Developed ML algorithms to project KPIs(ex. revenue, volumes)
  • Developed a unified API to pull KPIs for various enterprise entities(ex. facility, serviceline, clinic, provider etc.) using python tornado web server.
  • Analyzed and implemented intelligent caching to reduce application load times and run time for data jobs.
  • Developed an web application to pull data from above API to render dashboards and other data visualizations using angular.js, coffeescript, d3.js and other javascript libraries.
  • Configured and maintain nginx servers to act as SSL endpoint, load balancer and serve web applications.
  • Configured and apply linux command line tools to maintain prod servers using tmux, cron, systemd, bash etc.
  • Developed and implemented authentication module to access control of API endpoints using django.

2016

(SQL, Tableau, python, scikit-learn, django)

  • Developed applications to assist health care providers in advancing our Population Health Initiatives.
  • Built and deployed machine learning models to predict patient readmissions using scikit-learn, pandas, numpy etc.
  • Developed near real-time actionable notifications to our care-coordination team.
  • Created several tableau dashboards to surface key information for decision makers at all levels of the organization.
  • Work with fellow developers using agile development practices, and continually improving development methods with the goal of automating the build, integration, deployment and monitoring of jobs and Machine Learning(ML) pipelines.
  • Designed and implemented generic parallelized data integration tools to handle ETL jobs.
  • Deployed ML tools and encouraged their adoption across the company.
  • Integrated and maintained over 85 data pipelines from EMR systems, external data sources, flat files etc.

2015

Business Analytics (SQL, Tableau, SSIS)

  • Lead the business analytics team in developing an in-house EDW.
  • Mentor the team to incorporate best practices for software development (GIT version control, testing, automation etc.)
  • Architect the data pipelines and underlying process to integrated data from all major business units into EDW.
  • Improved EDW architecture and performance.
  • Work with data source domain experts, who understand the value potential for their data, collaborate to harvest, land and prepare that data at scale.
  • Leveraged my technical expertise to architect and implement solutions to critical business analytics problems.(e.x. Orthopedic Serviceline dashboard)

Media Mentions

Arnot Ogden Medical Center is Reducing Readmissions

When a patient with four or more admissions is in the ER, a real-time alert activates and the Action Team of emergency department and outpatient case managers, community-based organizations, and physicians is mobilized…see more

Arnot Health Uses Predictive Analytics to Advance Care Coordination

Care coordination is essential to improving patient satisfaction and healthcare outcomes. It’s at the core of a strategic initiative that Arnot Health implemented to reduce readmissions and frequent emergency department visits at its three hospitals and 52 outpatient clinics across 55 miles…see more


THIRSTIE

Software Consultant

New York, New York · May 2013 – Oct 2018 · 5+ years

  • Employee #5 in this 7+ year old start-up.
  • Deployed several core services to process credit card transactions Braintree, package tracking with Glympse.
  • Developed tools and automation scripts to onboard new licensed retail partners and sync their inventories.
  • R&D for inventory management, business intelligence and software development in general.

Software Developer, Cloud R&D

Syracuse, New York Area · Aug 2011 – Apr 2015 · 3+ yrs

2014

  • NaviCloud Director (coffeescript, python, nodeJS, Selenium, RabbitMQ, Mongo DB, vmware vCloud Director etc.)
  • Worked in our flagship(Infrastructure As A Service) product development scrum team, with a shared responsibility to deliver a next generation product.
  • Designed and developed a caching layer to significantly reduce initial app load times, using node.js worker.
  • Implemented a search mechanism so that users could quickly narrow down to cloud assets (vm, networking, data-center etc)
  • R&D for Continuous Integration and automated tests with Selenium to suit our product.
  • REST API development in Python on a Tornado Web server framework with a mongoDB datastore.
  • Web app development using coffeescript, twitter bootstrap, spineJS as our MVC framework, grunt js as our build tool.
  • Developed ansible scripts to automate frequent prod and dev tasks (ex. stock deployment, app stack updates etc.)

2013

  • Near realtime stream processing and analytics(clojure, python, mongoDB, HAProxy, aleph, netty, RabbitMQ)
  • Developed a syslog event stream processing system to comply with SAS70 audit requirements.
  • Harnessed features of load-balancers to achieve scalable and fault tolerant architecture.
  • Developed a scalable layer to receive data stream in clojure using netty framework to push data to queue.
  • Utilized RabbitMQ message queue to streamline events into distributed storm processing nodes.
  • Orchestrated mongoDB clusters to map reduce and deliver near real time reports via webnoir API server.
  • Developed automated fabric scripts to deploy, monitor and control nodes/layers in the application stack.

2012

  • Cloud Services Platform IaaS (Linux Apache MySQL Python, git, javascript, Java, vmware vSphere etc.)
  • Researched and incorporated various features into our application (ex. 2 factor auth, automation scripts for customer provisioning etc.)
  • Researched and developed several POCs, in-order to investigate new virtualization technologies.

The NaviCloud® Platform | NaviSite

The NaviCloud® platform sets the standard for enterprise-class infrastructure and application performance. This robust, virtualized infrastructure is deployed as multiple, secure infrastructure clouds in NaviSite’s data centers, serving as the foundation for all of NaviSite’s infrastructure, hardware, and application service offerings.


State University of New York

Research Assistant at United Health Services(UHS)

Binghamton NY · May 2009 – Aug 2011 · 2+ year

2011

  • Worked with a clinical team to understand various facets and causes of readmissions, further developed a probabilistic scoring model(LACE tool) from research to project patient’s readmission likelihood.
  • UHS was awarded “Siemens 2011 Inspired Healthcare Outcomes Challenge” for LACE tool.
  • Integrated more than 7 Systems into the Enterprise Data Warehouse.
  • Assisted the financial division to analyse P&L, Budget formulation and project reimbursement.

2010

  • Internship at Katalytik Inc
  • Android App development (Android(Client), JSON(web services) & Spring, hibernate, mongoDB)
  • Co-designed and developed our core web services, desktop and mobile app for Clinical Physician Order Entry(CPOE).
  • Designed & developed statistical, data mining models in SQL Server Analysis Services.
  • Build OLAP Cubes in Business Intelligence Development Studio to support UHS in making decisions.
  • Created and maintained Tables/Views, SQL stored procedures/queries/codes and executive dashboards.
  • Provided DSS/Crystal Reports/OLAP Cubes training to Analysts, Super users & Department members.
  • Develop and schedule SQL Server Integration Service packages to update analytical Database Servers.
  • Demonstrated ability to investigate, analyze information and to draw conclusions.

Media Mention

Siemens Names 2011 ‘Most Inspired’ Healthcare Providers

United Health Services, , used Siemens Decision Support Solutions (DSS) to help reduce hospital readmissions from 9 percent in 2009 to 6 percent in 2011. DSS tools, such as stored procedures and integration services, were used to calculate a LACE (Length of stay, Acuity, Co-morbid conditions, previous Emergency department visits) score for each patient. Scores were then compiled in reports which helped to focus the attention of nursing unit care managers on the patients at highest risk. This helped remind care managers to provide education, post-discharge instructions and medication management instructions…see more

Education

Fall 2016

Harvard University

Big Data in Healthcare Applications CSCI E-87 Grade A

Fall 2008

State University of New York at Binghamton 2008 - 2011

Master of Science, Computer Science GPA: 3.7

Expert Mining - Master project

Summer 2003

Mumbai University - 2003 - 07

Bachelor of Engineering, Information Technology Pillai’s Institute of Information Technology - New Panvel

Project link: https://github.com/codein/codein.github.io/blob/master/files/resume.md

Build with Hugo and other OSS