Research Assistant (Department of Health Policy & Management)
We are seeking a motivated and detail-oriented Research Assistant to assist with a research project focused on data extraction. The successful candidate will be responsible for tasks involving HTML parsing and extracting structured data from TXT, html and PDF files using Python. This position offers the opportunity to apply technical skills in real-world research applications. The ultimate goal of the project is to create a high-quality dataset intended for public consumption. Extensive documentation and collaboration with other colleagues conducting quality control checks will be integral parts of the process.The Research Assistant oversees data collection, data organization, and/or data management or similar functions/tasks for research study(ies) in support of a PI or a research team.Specific Duties & ResponsibilitiesRun routine and ad hoc reports.Use standard tools and computer programs to review data.Assist with data cleaning measures to ensure accuracy of data and preparation of tables.Lead basic activities such as data collection and data entry.May lead specific tasks and develop processes to ensure study activities occur effectively and efficiently.May conduct literature searches to support faculty in research efforts.May design and format papers/publications.May assist PIs in writing summaries of papers for release as policy briefs or other channels.Other duties as assigned.In addition to the duties described above Parse and extract data from HTML and TXT files to generate structured datasets. Develop and implement Python scripts to automate data extraction. Refine and improve existing code to enhance efficiency and functionality. Clean and preprocess extracted data for further analysis. Write unit tests to ensure quality. Document workflows, scripts, and processes comprehensively to ensure reproducibility and transparency. Collaborate with other team members to ensure data quality through regular checks and reviews. Contribute to project milestones and adhere to deadlines.Minimum QualificationsBachelor's Degree in a related field. Additional education may substitute for required experience and additional related experience may substitute for required education beyond a high school diploma/graduation equivalent, to the extent permitted by the JHU equivalency formula. Preferred Qualifications Proficiency in Python programming, including libraries such as BeautifulSoup for HTML processing, unit testing frameworks and object oriented design. Familiarity with data cleaning, preprocessing, and handling diverse file formats. Strong analytical skills with attention to detail. Ability to work independently and efficiently manage time.Technical Skills & Expected Level Of ProficiencyAnalytical Skills - Awareness Attention to Detail - Awareness Data Management and Analysis - Awareness Formatting and Layout Proficiency - Awareness Information Gathering - Awareness Oral and Written Communications - Awareness Organizational Skills - Awareness The core technical skills listed are most essential; additional technical skills may be required based on specific division or department needs.Classified Title: Research AssistantRole/Level/Range: ACRO40/E/03/CDStarting Salary Range: $17.20 - $30.30 HRLY ($25.17 targeted; Commensurate w/exp.)Employee group: Casual / On CallSchedule: Hours Vary: Up to 27 hours per weekFLSA Status: Non-ExemptLocation: RemoteDepartment name: 60004112-Health Policy & Management -IndependentPersonnel area: School of Public Health