iSchool Capstone

2016

Project Logo

Analyzing the Legal Field of Security and Privacy

An ever increasing amount of data collection on social media contributes to additional security implications which are not outlined in the End User License Agreements that we see today. We believe that this data can be used in many ways to violate user anonymity and create digital profiles of users based on various data processing methods. We can then cross-reference these digital profiles to other social media platforms to identify previously anonymous users. To do this, we hope to use numerated social media APIs which contribute to the release of Personally Identifiable Information. From this, We hope to educate users on End User License Agreements while informing them on applications of data usage by performing analysis on Reddit user data and other social media platforms.
Project Logo

Automated Data Analysis Framework

The Financial Inclusion Insights (FII) team at the Gates Foundation is responsible for performing data quality checks and analysis on survey data collected from eight countries in Africa and Asia. The survey data is related to usage of mobile money and digital financial services (DFS) in these countries. The current process is manual and tedious and involves multiple people working simultaneously to perform data analysis. Our solution solves this issue by automating the manual process and generates customizable data analysis reports. The result is a robust, modularized and highly customizable framework which will make it easy for FII to explore the “what”, “how” and “why” of demand­ side trends in mobile money and other digital financial services (DFS). This helps FII model, construct, and perform better research to understand the impact of DFS and promotes adoption of these services as they are revised using the research insights.
Project Logo

Curato

Although the internet provides a wide breadth of information in an extremely easy manner, a common issue people face -- regardless of the type of information -- is an overabundance of information. Services like Yelp, TripAdvisor, and GoGoBot provide lots of information, but do not always provide an easy way to whittle restaurant/activity choices to help a user make a decision. Moreover, the results of a search may not always be relevant to that user’s personal interests, due to how general the results are. Curato attempts to provide a single, convenient application to help users find businesses and points of interest relevant to their interests by taking advantage of simple machine learning algorithms.
Project Logo

Echo

Echo is an interactive sound visualization tool, designed to help students learn about sound design and audio engineering. Currently, students do not have the resources that they need in order to learn about audio engineering and acoustic environments. Most modern sound visualization tools are proprietary and require industry knowledge to discern meaning from them. Echo aims to help teachers keep their students interested and engaged in learning about sound design concepts by implementing a unique approach to sound visualization. We discovered that virtual reality is the ultimate medium to immerse someone in an acoustic environment, and will promote the highest level of understanding in all of our users. Our goal is to lower the barrier of entry into the professional sound design and audio engineering industry. This will effectively enrich the knowledge pool in the industry, therefore leading to greater insight and discovery for acoustic designers on all levels.
Project Logo

EnVizion - Empowering indigenous minds through technology

We are working with the School of Environment and Forest Science (SEFS) at University of Washington towards the mission of making environmental data accessible to Native American people. The researchers at SEFS have collected large data sets related to land cover, hydrology and precipitation to learn carbon emissions and absorption in that area. The challenge that they currently face is to make this scientific data available to the Native Americans in a format that they can relate with and interact. In order to facilitate this, we have integrated the existing environmental and ecological data and created data visualizations as a POC to ascertain that this process can be automated with the real time data. We have also prototyped user-friendly dashboard to present this information to the indigenous people in an intuitive way so that they can evaluate the health of their land and make better decisions about their environment.
Project Logo

Everqry: Adobe Stock User Intention Analysis

In the realm of online content retrieval, understanding user intentions based on search behaviors is essential to connecting customers to the content they seek. Adobe Systems recently acquired Fotolia, a digital asset supplier, to more efficiently provide their customers with quality images and videos from inside their ecosystem. Working with Adobe Stock, we accessed all of their query and content engagement data collected to date. In this formative analysis, we applied natural language processing to search terms. Results were paired with multiple metrics of user interaction associated with Adobe’s content. These data were grouped, or clustered, to reveal hidden layers of similarity across queries. These clusters, representing similar user intentions, help identify Adobe’s underperforming segments of customer interests and behaviors. Once identified, treatments such as user interface modifications, search algorithm changes, or query refinement suggestions can be targeted to queries in the same cluster. This will enhance the Adobe Stock user experience¬ increasing customer retention, satisfaction, and spending.
Project Logo

intelligentIR

Starbucks’ information security team is continually seeking to understand which of its security events to prioritize for response. Although the organization utilizes a security information and event management tool for detecting anomalous activity, the number of alerts being generated by the tool are overwhelming and difficult to manage. This is an issue that security teams at many large organizations face; how do you sift through the noise and find the events that are most likely indicative of a security threat or breach? IntelligentIR helps answer this question through the use of machine learning techniques. Using unsupervised learning to label raw security data along with supervised learning to build decision models, intellingentIR identifies and prioritizes new security alerts in order to make incident response more manageable.
Project Logo

Mission Admission

Every admission season, schools struggle with attracting the best talent and achieve maximum conversion rate at the same time. They want to avoid both under and over filling of the classrooms. Therefore, there is a need to strategically balance the quality and quantity of incoming student population. We extracted and analyzed important features of the data provided by our sponsor Ravenna Solutions. Our predictive analysis shows how significant select factors are in determining a future admit offer or an admit conversion. Thus, our project provides key insights to admission directors of K-12 schools regarding factors influential in making admission decisions. We help admission directors in making crucial admission decisions backed by application data of student aspirants. This makes the directors rely less heavily of their instinct or “gut feeling” and instead make data driven decisions. Hence, the efficiency in decision making results in better enrollments for the schools and thereby better school admissions for the students alike.
Project Logo

PainlessVR

PainlessVR is a research tool that allows researchers from medical, psychological and any other fields to test hypotheses about Virtual Reality without any development at all. The goal of the product is to improve future virtual reality pain management applications by removing the barriers to researching virtual reality. PainlessVR allows researchers to modify, and by extension study, variables regarding color, sound, context and cognitive load. Improving VR pain management would mean improving care for those who are unable to take traditional pain medication such as burn victims or recovering opioid addicts.
Project Logo

Patent Patterns

Patents are arguably one of the most important means of rewarding innovation and creativity. According to the U.S. Commerce Department, in 2010, IP-intensive industries accounted for 34.8 percent ($5.06 trillion) of U.S. gross domestic product. Despite this significant contribution, the process of acquiring and maintaining a patent remains fraught with complexities. Through our research project we intend to shed light on some of these complexities and provide data driven insights into this process. By scraping, cleaning, and analyzing 10 years worth of publicly available utility patent data, we have attempted to examine and visualize some interesting and pressing topics like prevalence of any gender bias, trends around industries/organizations that produce patents, and countries that spearhead innovation across the world, among others. We believe the data and insights produced by this project can guide future research and improvement efforts in this field thereby benefiting the patent industry, in a broader sense.