Position Information
Posting date |
12/18/2024 |
Closing date |
|
Open Until Filled |
Yes |
Position Number |
1129150 |
Position Title |
Artificial Intelligence Research Data Science Specialist |
Department this Position Reports to |
Research Data Services |
Hiring Range Minimum |
$108,700 |
Hiring Range Maximum |
$125,000 |
Union Type |
DCLWU |
SEIU Level |
Not an SEIU Position |
FLSA Status |
Exempt |
Employment Category |
Regular Full Time |
Scheduled Months per Year |
12 |
Scheduled Hours per Week |
40 |
Schedule |
|
Location of Position |
Hanover, NH |
Remote Work Eligibility? |
Hybrid |
Is this a term position? |
No |
If yes, length of term in months. |
NA |
Is this a grant funded position? |
No |
Position Purpose |
This position works as part of the Dartmouth Libraries Research Data Services team to support research, curricular, and applied artificial intelligence work on campus. The person in this role will bring data science skills together with necessary expertise in information curation and knowledge management to support a variety of generative artificial intelligence applications, such as semantic search, retrieval augmented generation, and information/data retrieval application development. Working alongside campus partners engaged in data science and generative artificial intelligence work, this role will focus on database creation, data ingestion, information preprocessing and embedding, vector database management, and system optimization.
This position is hybrid work location eligible. |
Description |
|
Required Qualifications - Education and Yrs Exp |
Bachelors plus 3-5 years' experience or equivalent combination of education and experience |
Required Qualifications - Skills, Knowledge and Abilities |
- BA in quantitative or related field + 3-5 years experience, or; MA in a quantitative or related field + 1-3 years, or; PhD in a quantitative or related field; or MLIS + 1-3 years
- 1-3 years of relevant education or work experience in research or applied AI environments
- Demonstrated knowledge of programming/ scripting languages and analysis applications (e.g., R, Python, SAS, SPSS)
- Experience with using GenAI, Deep Learning frameworks, and Natural Language Processing (NLP) for projects; or, experience with database design and development
- Experience with preparing data for analysis, visualization, and other procedures
- Demonstrated ability to work independently and as a team member to solve problems
- Excellent oral and written communication skills
- Strong interpersonal and organizational skills
- Excellent analytical skills
- Willingness to learn new programming languages, statistical analysis tools or other relevant tools as needed
|
Preferred Qualifications |
- Experience with data tools and services, including HPC, in a research library or academic/research setting
- Demonstrated ability to initiate, plan, coordinate, implement, and assess complex programs, projects, and services.
- Professional experience working with research data and/or in an academic library
- Demonstrated knowledge of data management, curation, and preservation principles and practices
- Demonstrated knowledge of open data, data repositories, and the data life cycle
|
Department Contact for Recruitment Inquiries |
Lora Leligdon, Head of Research Data Services |
Department Contact Phone Number |
603-646-3845 |
Department Contact for Cover Letter and Title |
Lora Leligdon, Head of Research Data Services |
Department Contact's Phone Number |
603-646-3845 |
Equal Opportunity Employer |
Dartmouth College is an equal opportunity/affirmative action employer with a strong commitment to diversity and inclusion. We prohibit discrimination on the basis of race, color, religion, sex, age, national origin, sexual orientation, gender identity or expression, disability, veteran status, marital status, or any other legally protected status. Applications by members of all underrepresented groups are encouraged. |
Background Check |
Employment in this position is contingent upon consent to and successful completion of a pre-employment background check, which may include a criminal background check, reference checks, verification of work history, conduct review, and verification of any required academic credentials, licenses, and/or certifications, with results acceptable to Dartmouth College. A criminal conviction will not automatically disqualify an applicant from employment. Background check information will be used in a confidential, non-discriminatory manner consistent with state and federal law. |
Is driving a vehicle (e.g. Dartmouth vehicle or off road vehicle, rental car, personal car) an essential function of this job? |
Not an essential function |
Special Instructions to Applicants |
Dartmouth College has a Tobacco-Free Policy. Smoking and the use of tobacco-based products (including smokeless tobacco) are prohibited in all facilities, grounds, vehicles or other areas owned, operated or occupied by Dartmouth College with no exceptions. For details, please see our policy.
https://policies.dartmouth.edu/policy/tobacco-free-policy
|
Additional Instructions |
|
Quick Link |
https://searchjobs.dartmouth.edu/postings/77026 |
Key Accountabilities
Description |
Works with researchers, staff, and students to refine the collection and curation of corpus documents to ensure datasets are suitable for artificial intelligence and related computational techniques. Designs database architectures for storing documents and the vector databases that will hold document embeddings. While ensuring database scalability, reliability, and performance optimization, monitors the system's performance and optimizes queries to ensure quick retrieval times and high relevance of retrieved documents. Regularly updates the database with new entries and re-indexes as needed. |
Percentage Of Time |
30% |
Description |
Assists researchers, staff and students in the development and application of document preprocessing pipelines to clean and prepare text data for embedding. Automates transcription processing where necessary, including language detection, segmentation, and annotation.
Collaborate with librarians to properly handle metadata and maintain data integrity. |
Percentage Of Time |
20% |
Description |
Utilizes machine learning models to generate embeddings from preprocessed text data. Indexes embeddings efficiently within the vector database for fast retrieval. Analyzes retrieval accuracy and optimizes the system by applying query transformations and result reranking techniques. |
Percentage Of Time |
20% |
Description |
Provides instruction, outreach, and consultations on advanced computing concepts for faculty, students, and staff to expand computational research skills (including data discovery, curation, management, storage, analysis, visualization, and preservation) as needed for curricular or research projects. |
Percentage Of Time |
10% |
Description |
Collaborates with Library Research Data colleagues and Information Technology & Consulting Colleagues to integrate databases effectively with campus AI infrastructure and large language models, and to fine-tune the models based on the data structure and requirements. |
Percentage Of Time |
10% |
Description |
Engages in focused professional development activities and serves on applicable Dartmouth committees and task forces, with an emphasis on data science techniques, generative artificial intelligence, and ethical applications of novel technologies. Recommends and facilitates improvements to existing programs and services, and participates in internal training and professional development for Dartmouth Library and related staff. |
Percentage Of Time |
10% |
-
-- |
Demonstrates a commitment to diversity, inclusion, and cultural awareness through actions, interactions, and communications with others. |
-- |
Performs other duties as assigned. |
|