What is a property of a good color model for ordinal data?
A. Uses a rainbow-like color map for distinction of categories
B. Uses a rainbow-like color map for ease of display and printing
C. Uses perceptually ordinal colors with just-noticeable increments
D. Uses perceptually ordinal colors with linear, perceptual increments
In which step in the visualization lifecycle would you determine how the raw data is stored?
A. Visualization Planning
B. Data Preparation
C. Visualization Building
D. Discovery
What are two visualization tools used for trivariate data?
A. Scatter plot matrix
B. Hexbin plot and heatmap
C. Scatter plot matrix and density plot
D. Scatter plot matrix and heatmap
You are analyzing written transcripts of focus groups conducted on product X. You approach is to use TFIDF for your analysis.
What combination of TF-IDF scores should you examine to ensure you only report on the most important terms?
A. High TF score and high DF score
B. High TF score and high IDF score
C. High TF score and low IDF score
D. Low TF score and low DF score
What do lemmatization and stemming have in common?
A. Use WordNet
B. Remove common words in a natural language
C. Reduce the high dimensionality in text
D. Use a set of heuristics
Consider dataset that resides in HDFS. Which tool natively provides the capability to run a Random Forests model against this data?
A. Mahout
B. Pig
C. Hive
D. HBase
What is a characteristic of the trigram language model?
A. Based on the second-order Markov process
B. Equivalent to trigram hidden Markov models
C. Uses smoothing to reduce the high dimensionality in text
D. Can be used for part-of -speech tagging
What best describes tokenization?
A. Adding lexical relations to the raw text
B. Converting text into the list of terms
C. Converting text into a list of unique terms
D. Reducing variant forms of tokens to their base forms
Consider the two sentences below. I mailed my credit card application to the bank We walked along the river bank until we came to a waterwheel
What type of NLP ambiguity might occur when interpreting the word "bank"?
A. Discourse
B. Syntactic
C. Semantic
D. Acoustic
What is a characteristic of stemming?
A. Reduces words of variant forms to their base forms based on a set of heuristics
B. Can be performed by calling the stemming!) function on a lemma in NLTK
C. Can be performed by calling the stemming() function on a synset in NLTK
D. Reduces words of variant forms to their base forms based on a dictionary