The goal of this exercise is to extract keywords from emails and use then to deduce the expertise of a person and track the changes for a period of time.
How can you help?
>> java -jar ExpertFinding.jar
The *.mdl files are java serialized objects which stored some of the information from your emails (details given below) . The keywords file includes a list of keywords extracted from your emails for your inspection.
What information is gathered and why?
The following information is gathered from each of your email
username :to identify the account holder
folder :to filter emails according to folders
subject :used as short textual representations of the emails while generating reports
sender :to keep track of conversations between people
receiver :same as above
date :temporal analysis
keywords :each email is analyzed and keywords extracted, to be used for concept based analysis of expertise.
Also the source code is also available for inspection on the websvn
What is done with the information?
An example of an expertise graph against time for Pradeep during the past nine months is shown below.