Software/Algorithms

We have developed a set of text-mining algorithms to extract education and occupation from electronic health records. These are important variables describing socioeconomic status (SES) which can be incorporated into research studies utilizing electronic health record data. The development and evaluation of the algorithm is described in PMC5147499 and the exclusion, jobs, and prefix lists can be found here. Detailed use of the package can be found on the github site.