Solution Architect
Philadelphia, PA
elizabeth.a.sheffield@gmail.com
Research Focus
Natural Language Processing
Sentiment Analysis
Language Generation
Data Propogation
Languages
English
Perl
Python
Current PhD student advised by Dr. Jake Ryland Williams
Graduated with Honors
I am a solution architect currently pursuing a part-time PhD in Information Science at Drexel University. Initially interested in the combination of computer science and linguistics during undergrad, I am particularly interested in how NLP can simplify data analysis/comprehension. I have a secondary interest in how machines can better comprehend human language idiosyncrasies and how we can leverage data to address social issues.
On a personal note, I enjoy hiking, running, and playing open world video games.
Entertainment Metadata Architect supporting apps on Sky, NBC, and Comcast platforms. Focused on Sports, Music, and VOD metadata. Supporting inbound and outbound metadata flows, architecture work revolves around a centralized master data management (MDM) platform for entertainment metadata. The metadata empowers complex content discovery and personalization use cases.
Career Progression from: Java Developer, Technical Lead, Delivery Lead, Senior Systems Analyst, to Solution Architect in the Provider Data Domain. Designed large scale solutions involving multiple custom and vendor applications providing users the tools to analyze and manage provider data. Working in a quasi-researcher function, evaluated vendors, produced options, and worked with the scrum teams to deliver the final solutions.
Supported a course focused on efficient storage, organization, and retrieval of information
Utilizing a repository of 8+ year old Twitter accounts, this research includes developing a taxonomy for episodes of social media use, algorithmically segmenting timelines into episodes of use, evaluating the algorithmic segmentation. Finally features will be identified to train a model on predicting if the author of posts within an episode of social media use is the same author of a previous episode of use - this makes it a low context author attribution task.
Customer service interactions are currently being steered towards chat bot interactions, but chat bots are not skilled at detecting the use of sarcasm or irony in responses. Looking at current methods of sarcasm detection on stand-alone tweets and customer reviews, then applying methodologies conversations (expanding datasets to reply-tos and customer service logs).
Theory: Valid data updates within an integrated domain should behave differently, possibly entering the network through multiple nodes, than bad data updates. i.e. Fake events or facts should propogate through news sites in a different manner than a real event
When identifying the poet for a given stanza of text, are morphological/phonological/style statistics more relevant than word choice?
Hiking Trail Selection can be a convoluted process for hikers unfamiliar with available trails. Individuals may seek to find trails of specific length, location, or other feature, but while this information is available in the descriptions of trails on local websites, the information is neither linked nor searchable. The Hiking Trail Ontology seeks to address this gap.
A database of NFL Running Back game performance metrics and social media activity metrics using the SportsDataIO API and Twitter API. Preliminary structure to support questions around the impact of social media use/trends on player performance.