About nference
At nference, the “Google of Biomedicine,” we are making the world’s biomedical knowledge computable to power quantum leaps in human health. By partnering with elite medical institutions, we transform massive, unstructured datasets-including clinical notes, lab results, and medical images-into a structured, research-ready ecosystem. Leveraging state-of-the-art AI and high-performance computing, we bridge the gap between clinical phenotypes and molecular genotypes to accelerate the discovery of next-generation diagnostics and personalized treatments.
Requirements
-
Pursuing a Bachelor's or Master's degree in Computer Science, Information Technology, Data Science, Statistics, Mathematics, Engineering, or a related quantitative field, with an expected graduation year of 2026 or 2027.
-
Strong understanding of SQL for querying, joining, filtering, and aggregating data across multiple tables.
-
Hands-on experience with Python (especially pandas and NumPy) through coursework, internships, or academic/personal projects.
-
Good understanding of descriptive and diagnostic analysis, with the ability to interpret data and derive meaningful insights.
-
Experience working with structured or semi-structured datasets, including data cleaning, transformation, and validation.
-
Ability to break down business or analytical problems into logical, manageable steps.
-
Basic understanding of data visualization principles, with experience creating dashboards or reports using Excel, Power BI, Tableau, or similar tools.
-
Interest in automating repetitive data analysis and reporting tasks using SQL or Python.
-
Strong analytical thinking, attention to detail, and problem-solving skills.
-
Good written and verbal communication skills, with the ability to present findings clearly.
-
Willingness to learn new analytical tools, technologies, and best practices in data analysis.
Good to Have (Optional)
-
Familiarity with notebook-based workflows such as Jupyter Notebook or Google Colab.
-
Exposure to large datasets, distributed data processing concepts, or basic statistical and predictive analysis through academic projects.
-
Participation in hackathons, research projects, Kaggle competitions, or open-source data projects.
Responsibilities
-
Apply foundational knowledge of SQL, Python, and data analysis techniques to solve analytical problems and generate accurate insights.
-
Query, clean, transform, and validate data from multiple sources to create analysis-ready datasets.
-
Perform descriptive and diagnostic analysis to identify trends, patterns, anomalies, and actionable insights.
-
Support data validation and quality assurance by identifying inconsistencies, missing values, and logical issues in datasets.
-
Create dashboards, reports, and visualizations that effectively communicate analytical findings to stakeholders.
-
Collaborate with mentors and cross-functional teams to understand business requirements and translate them into analytical tasks.
-
Document analytical methodologies, assumptions, and findings to ensure clarity, reproducibility, and knowledge sharing.
-
Assist in automating recurring data extraction, validation, and reporting processes using SQL and Python.
-
Gain exposure to data pipelines, scalable data processing concepts, and modern analytics workflows.
-
Continuously develop technical and analytical skills by learning new tools, techniques, and industry best practices while contributing to team initiatives.