课程: Complete Guide to NLP with R
今天就学习课程吧!
今天就开通帐号,24,700 门业界名师课程任您挑!
Examining sources
- [Instructor] As we've discussed, sources are a way to bring in documents into Corpora and there are several different types of sources. Let's take a look at them. In line one, I've brought in the text mining library and I'm also going to bring in the read text library 'cause I'll need it here in a minute. Before we get too far, you can always pull up a list of sources available to you with the text mining get Sources command as shown in line five. When I run that command, you can see that the console has produced a list of different sources dataframe source, directory source, URI source, vectorSource, XMLSource, and zip source. We'll cover each of those. Let's start with dataframe source. In line 10, I create a dataframe, read text, reads a bunch of files and produces a dataframe with those files in it. We'll run that and you'll see that I now have a dataframe, which is a dataframe of all of the poetry files. Let's take…
随堂练习,边学边练
下载课堂讲义。学练结合,紧跟进度,轻松巩固知识。