Phase Zero of Human Name Detection for Border Observer

Objective Find a person's full name within a press release or news story. Reason Currently - on average, we are seeing 200 news stories per weekday . Clustering the same or similar news stories is helpful for readers, but too often the headlines don't match very similar stories. As such, with slightly dissimilar headlines, the task of clustering becomes tedious, but there is a solution. It is well known (by internet search engines) that some internal "markers" (like a person's name) will help with the clustering of similar webpages, press releases or news stories. That is the focus of work ALMOST done. BOTTOM LINE As we have stated more than once, MOST news stories start with a press release. So, if we start by detecting a person's name in the press release, we can then catch that name early and thereby detect that name in news stories as they go online. So, we start here — clustering news stories based on the "full name" of the person in the news...