This project uses an existing corpus of newspaper articles to derive models that identify (1) whether a MIC occurred, (2) what type of militarized interaction was described, (3) whether fatalities occurred, and (4) the range of fatalities, if any. The newspaper articles have been retrieved based on boolean search terms associated with militarized conflict. However, more than 95% of the articles are false positives for identifying MICs.
Total project length: 175 hours
The project requires the ability to code in Python and knowledge of machine learning and natural language processing.
This project has a moderate level of difficulty.
Please DO NOT contact mentors directly by email. Instead, please email human-ai@cern.ch with Project Title and include your CV and test results. The mentors will then get in touch with you.