Edward Bullen - Building a ChatBot with Python, NLTK and scikit HD

14.06.2017
Filmed at PyData 2017 Description Introducing the basics of Natural Language Processing using Python NLTK and Machine Learning packages to classify language in order to create a simple Q&A bot. Abstract Working code samples and a basic ChatBot framework (written in Python) will be provided and explained so that a simple Q&A bot that learns from previous experience and responds to questions with appropriate answers can be created. In this talk we will cover: Build a basic ChatBot Framework using core Python and a SQL database. Demonstrate and experiment with a Learning-by-Example bot using ranking functions in Python and SQL to get some basic chat functionality working. Introduce the Python NLTK to extract features from the chat sentences and words stored in the chatbot database. Work through a feature engineering example using NLTK and Sci-Kit and Numpy to show how we can classify sentences using Supervised Learning and estimate the accuracy of our classification model. Apply the sentence classification ML model to our chatbot engine to target responses more accurately. Prerequisites Attendees will need: + Anaconda for Python 3.5 or 3.6 + NLTK (Python Natural Language Toolkit - pip install nltk) + The Stanford Java CoreNLP Parser (https://stanfordnlp.github.io/CoreNLP/ or wget http://nlp.stanford.edu/software/stanford-corenlp-full-2016-10-31.zip and un-zip) + Java rel 8 Theoretically all of this could be installed on the day but it would just help to save time by preparing in advance. Most of what I am demonstrating will probably work against Python 2.7, but it hasn’t been tested with 2.7. www.pydata.org PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. We aim to be an accessible, community-driven conference, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.

Похожие видео

Показать еще