7:30 Data for Non-Profits Working Session/Discussion
8:15 Closing Comments
Have you ever told someone at a party you are a data scientist or data engineer and find it to be the biggest buzz kill? Do you keep hearing about big data, artificial intelligence, and data science but have no idea how this applies to you? If you answered yes to either of these questions then this presentation is for you!
Sadie will break down data, what is it, how people are using it and why you are a lot closer to working with data then you may think. This talk will be helpful for those who already work with data to be able to explain the impact of their work, along with those new to the field who want to gain a better understanding of the world of data.
About Sadie St. Lawrence:
Sadie is the founder and executive director of Women in Data, a non-profit organization focused on creating economic opportunity for women in data and increasing global data literacy. She is also the lead data scientist for VSP Global and an instructor for UC Davis. In her free time, Sadie enjoys playing the piano, doing yoga, spending time outdoors, and walking her dog Bentley. She believes we are able to solve our biggest problems when we work with diverse groups and add a bit of compassion into everything we do.
We will discuss what makes a data practice a WMD, how to spot them, and how to disarm them. This presentation will be based on the book Weapons of Math Destruction by Cathy O'Neil. There is no need to read the book before attending, but feel free to read along beforehand to better engage in group discussion and avoid any unwanted 'spoilers.'
-Avolyn is a data analyst for Nationwide Insurance. She is a current grad student at Villanova completing her Master of Science in Analytics. She is passionate about data and using it for good.
Brief Description: This presentation walks through a methodology for building a predictive regression model to predict home prices. The project incorporates data cleaning, feature engineering, feature filtering, and one-hot encoding to prepare your data for model building. We compare various algorithms and walk through the process of hyper-parameter tuning. Short bio:
Jackie Zuker has a master’s degree in Bioinformatics from Johns Hopkins University, where she learned to analyze genomes and other large datasets using statistical analysis and coding. She has recently shifted her focus to data science, building predictive models for work in the casino industry. In her free time, Jackie enjoys living in the mountains, raising her two young daughters, and geeking out with data.
This will be an in depth technical training to get you up and running with Jupyter Notebooks, and Python using popular data science packages.
We have a limited number of spots, so please only RSVP if you can attend all of the sessions.
• Getting Started in Python & Jupyter
• Data Structures
• Exploration & Viz
• Missing Values
• Outlier Detection
• Linear Regression
• Feature Engineering & Transformations
• Validating Model Performance
• Logistic Regression
• Naïve Byes
• Decision Tree
• Ensemble Models
• Scikit Learn
Whether you're familiar with blockchain or not, this talk will explore the potential blockchain has to revolutionize computing as we experience it today. Beyond cryptocurrency we will discuss the implications for smart contracts, cyber security, the internet of things, and software within this new environment. We will briefly cover the history of blockchain and then dive into how blockchain actually works and the potential use cases being explored and implemented across the various blockchain platforms today. We will also dive into the use cases for AI including the promise of decentralized autonomous organizations and how smart contracts are really just autonomous agents living inside blockchain networks. This talk is meant to bring even beginners up to speed but also go into technical detail for those who are familiar but want a greater depth of understanding around blockchain.
Avolyn is a data nerd and blockchain enthusiast. When she is not out evangelizing blockchain she serves as Manager of Partner Relations/Strategist for Women in Data, Sr. Analyst for Nationwide Insurance, and grad student working toward her Masters in Analytics at Villanova.
Join your fellow data nerds in predicting the diagnosis of breast cancer with the Kaggle Breast Cancer Wisconsin (Diagnostic) data set. All ages and skill levels welcome, we will be using Python/Jupyter Notebooks and R to work on these machine learning problems. Light refreshments and brain food will be provided thanks to the UC Davis Masters in Data Science Program.
This presentation will demonstrate how to use Apache Spark for Data Science practices, which will be applied to the Global Terrorism Database (GTDB). It will include the required data preparation techniques (feature selection, cleaning, and transformation) before proceeding to clustering, anomaly detection, and model evaluation. The entire demonstration will be presented in an Apache Zeppelin notebook, and it will include a brief introduction Apache Spark and Apache Zeppelin. Those in attendance can participate using Hortonwork’s HDP sandbox, which is a free single node environment of Apache Hadoop. https://www.kaggle.com/START-UMD/gtd
6:30 - 7:00 Networking
7:00 - 8:00 Presentation
Time to break out your prediction skills or join a team and learn new ones! In this session, we will work in teams to tackle the Kaggle Titanic Machine Learning Problem (https://www.kaggle.com/c/titanic). This will be a great opportunity to practice on a real data set, learn from others and work in a team. Networking and teams will form at 6:30 pm and we will begin working on the problem at 7:00 pm. Please bring your laptop to this session.
Summary: In 2012, the Harvard Business Review named Data Scientist the sexiest job of the 21st century. The article and subsequent studies have called out a shortage of data scientists that will increase over the next 20-30 years. In this talk, I will explore what exactly is meant by the term “data scientist” and describe some aspects of STEM training that can transform a person into a data scientist in the eyes of corporate employers. Using my own career transition from a mathematician to data scientist, I will cite examples of problems that data scientists study including energy usage and renewables integration, environmental monitoring, market intelligence, and hiring and retention practices. I will pay particular attention to the “secondary” skills of STEM workers that can help them succeed as data scientists in the corporate world.
Presenter Bio: Dr. Genetha Anne Gray is an analytics research scientist in the Data Center Group at Intel where she works on the design and develop AI algorithms. Previously she was part of the Talent Intelligence & Analytics organization where she analyzed talent supply chains, studied career progression, and modeled the changing representation of women and URMs in the workforce. Before joining Intel in 2014, Genetha spent 12 years as a member of the technical staff at Sandia National Labs in Livermore, CA. There, she worked on problems related to the electrical and mechanical engineering of systems, the storage of nuclear waste, groundwater remediation, cyber security, and energy including renewables integration and grid operations. She has a Ph.D. in Computational & Applied Mathematics from Rice University and specializes in analytics techniques for decision making under uncertainty including optimization, data fusion, model validation, and uncertainty quantification. She has co-authored more than 25 research publications and given more than 50 presentations. Genetha also co-authored a recent text book on environmental modeling.
Register here: https://www.meetup.com/Sacramento-Women-in-Data-Science/events/237809624/
Summary: Effective storytelling is critical towards ensuring that data visualizations connect with your audience. In this interactive presentation, we will discuss how to leverage data to create or enhance the story being told. Key concepts include the Duarte philosophy of storytelling, incorporation of visual best practices, avoidance of common visualization mistakes, and strategies to improve audience engagement.
An incredible amount of work goes into collecting data, storing it in a data warehouse, running it through the ETL process, validating it, aggregating it, querying and reporting on it, and ultimately analyzing it to discover business value. If this information cannot be effectively communicated, then all that effort is for naught. By employing a narrative structure to analytical findings, the data can better resonate with audiences and facilitate actionable decisions.
We will examine the history of data visualization, practical storytelling examples in healthcare, and current best standards across all industries. You will be armed with the knowledge and tools necessary to improve your visual storytelling skills to make deeper connections with your audience.
This presentation is for anyone who is looking to utilize storytelling both to deliver more effective presentations and to showcase data in a meaningful way.
Presenter Bio: Brent Johnson is a Data Analyst/Consultant with Cerner Corporation, embedded at Adventist Health West, a Cerner partner headquartered in Roseville. His focus is utilizing data to better understand Population Health and the Revenue Cycle across hospital systems. He tells stories with data and uses software and tools to facilitate data-driven decision making. Brent earned his BA in economics from the University of Miami (FL) and his MBA from UC Davis. Prior to working in healthcare, he worked in the vision, biomedical research, and immigration industries.
Register here: https://www.meetup.com/Sacramento-Women-in-Data-Science/events/237125650/
Summary: There are many theories around pricing and consumer behavior, and many different methodologies on how to appropriately set prices for any good or service. This presentation/tutorial will focus on leveraging transactional data to understand consumer purchasing behavior, and apply statistical methodologies to develop pricing strategies. The presentation & walk through will largely focus on Ordinary Least Squares (OLS), and we will discuss how different models can be leveraged to improve upon the OLS example.
Presenter: Ryan Huffman, MS
Bio: Ryan Huffman has over 12 years of experience as an economist, working in a variety of industries where he offers expertise in market dynamics, consumer behavior, supply chain & operations, predictive analytics and statistical modeling. He is currently a Digital Analytics Manager for VSP Global where he has developed and built the analytics organization for VSP’s new e-commerce platform, Eyeconic.com. Ryan has a B.A. in Economics from Sonoma State University and an M.S. in Economics from Texas A&M.
Register here: https://www.meetup.com/Sacramento-Women-in-Data-Science/events/236511835/
- Enjoy complimentary lunch
- Meet and learn from top tech experts
- Connect with the best in IT recruiting
This event is intended for those interested in technology and data science of all experience. Get connected with experts in tech and data science!
Summary: Maps are an incredibly useful way to visualize and explore information. In this presentation, we'll cover some of the basics of working with spatial data in data science applications, including key concepts for working with spatial data, basic cartographic principles, and some clear examples for getting started in R, Google Earth, and other open source geospatial software platforms.
Presenter: Keely Roth, PhD
Bio: Keely is a geographer and remote sensing scientist with over 10 years' experience working with imagery and geospatial data. The majority of her research career has been focused on evaluating sensor capabilities for mapping and monitoring natural ecosystems. Recently, she joined the Geospatial Research Team in the Data Science Division of the The Climate Corporation, a precision agriculture analytics company. She loves working with pixels and spatial data to make useful, informative maps.
See how Sacramento-based VSP Global is transforming their ability to leverage data and analytics to create new business value. From technology to processes to people … get a peek into how a $4B organization is navigating this journey with Alan Cordell, Senior Director Enterprise Analytics & Information Management and Kelly Freitas, Senior Director Digital Engagement & Customer Insights.
Let's have some fun! Join us for a Networking Mixer at Hot Italian Downtown Sacramento, and meet and mingle with Data Science and Technology peeps and those interested in the field. This is a great opportunity to learn about Data Science and Technology, as well as network with those who currently are in school and work in the field.
See you there!