Question 1:
What is the primary purpose of the "pivot_table" function in Pandas when aggregating data?
A.
Aggregate and summarize data
B.
Reshape data
C.
Sort data
D.
Group data
Answer: _________
Question 2:
In Python, which library is commonly used for geospatial data analysis and mapping, especially in geographic information systems (GIS)?
A.
Plotly
B.
Seaborn
C.
Pandas
D.
GeoPandas
Answer: _________
Question 3:
Which Python library is often used for anomaly detection and outlier analysis in time series data?
A.
Pandas
B.
Seaborn
C.
Statsmodels
D.
PyOD
Answer: _________
Question 4:
What is the primary purpose of the "unstack" method in Pandas when working with hierarchical index data?
A.
Pivot a level of the index
B.
Grouping and aggregating data
C.
Sorting data
D.
Filtering data
Answer: _________
Question 5:
In Python, which library is commonly used for working with data stored in the Apache Parquet file format?
A.
Matplotlib
B.
Seaborn
C.
Pandas
D.
pyarrow
Answer: _________
Question 6:
In Pandas, which method is used to calculate the variance of a numerical column in a DataFrame?
A.
mean()
B.
sum()
C.
var()
D.
median()
Answer: _________
Question 7:
Which Python library is commonly used for creating interactive 3D visualizations and plots?
A.
Seaborn
B.
Plotly 3D
C.
Matplotlib
D.
None of the above
Answer: _________
Question 8:
What is the primary purpose of the "explode" function in Pandas when working with nested data in a DataFrame?
A.
Filtering data
B.
Sorting data
C.
Explode list-like columns
D.
Grouping and aggregating data
Answer: _________
Question 9:
In Python, which library provides tools for working with time series data, including time-based indexing and resampling?
A.
Statsmodels
B.
Numpy
C.
Pandas
D.
Pandas TimeSeries
Answer: _________
Question 10:
Which Python library is commonly used for distributed data processing and parallel computing, especially for big data analysis?
A.
Pandas
B.
Seaborn
C.
Dask
D.
Matplotlib
Answer: _________
Question 11:
In Pandas, which method is used to calculate the covariance between two numerical columns in a DataFrame?
A.
sum()
B.
cov()
C.
median()
D.
None of the above
Answer: _________
Question 12:
What is the primary purpose of the "agg" method in Pandas when applying multiple aggregation functions to different columns?
A.
Grouping data
B.
Filtering data
C.
Aggregating data
D.
Sorting data
Answer: _________
Question 13:
In Python, which library is commonly used for working with time series data and performing forecasting tasks with a focus on simplicity?
A.
Pandas
B.
Numpy
C.
Prophet
D.
statsmodels
Answer: _________
Question 14:
Which Python library is often used for working with large-scale, multidimensional arrays and data manipulation in scientific computing?
A.
Dask
B.
Seaborn
C.
xarray
D.
Pandas
Answer: _________
Question 15:
What is the primary purpose of the "clip" method in Pandas when working with numerical data?
A.
Grouping and aggregating data
B.
Sorting data
C.
Filtering data
D.
Clip values outside a specified range
Answer: _________
Question 16:
In Python, which library is commonly used for visualizing decision trees and random forests in machine learning?
A.
dtreeviz
B.
Pandas
C.
Seaborn
D.
Matplotlib
Answer: _________
Question 17:
What is the primary purpose of the "pipe" method in Pandas when chaining multiple data transformation steps?
A.
Filtering data
B.
Sorting data
C.
Grouping and aggregating data
D.
Apply a sequence of functions
Answer: _________
Question 18:
In Python, which library is often used for working with graph data structures and performing graph algorithms?
A.
NetworkX
B.
Seaborn
C.
Pandas
D.
igraph
Answer: _________
Question 19:
Which Python library is commonly used for creating interactive and web-based dashboards for data analysis and visualization?
A.
Pandas
B.
Seaborn
C.
Panel
D.
Dash
Answer: _________
Question 20:
What is the primary purpose of the "rolling" method in Pandas when working with time series data?
A.
Sorting data
B.
Calculate rolling statistics
C.
Filtering data
D.
None of the above
Answer: _________
Question 21:
Which of the following makes use of pandas and returns data in a series or dataFrame?
A.
pandaSDMX
B.
freedapi
C.
OutPy
D.
none of the mentioned
Answer: _________
Question 22:
Which of the following is used for machine learning in python?
A.
scikit-learn
B.
seaborn-learn
C.
stats-learn
D.
none of the mentioned
Answer: _________
Question 23:
Which of the following indexing capabilities is used as a concise means of selecting data from a pandas object?
A.
In
B.
ix
C.
ipy
D.
none of the mentioned
Answer: _________
Question 24:
Point out the correct statement.
A.
Pandas consist of set of labeled array data structures
B.
Pandas consist of an integrated group by engine for aggregating and transforming data sets
C.
Pandas consist of moving window statistics
D.
All of the mentioned
E.
Timedeltas are differences in times, expressed in difference units
F.
You can construct a Timedelta scalar through various argument
G.
DateOffsets cannot be used in construction
H.
All of the mentioned
I.
Pandas represents timestamps in microsecond resolution
J.
Pandas is 100% thread safe
K.
For Series and DataFrame objects, var normalizes by N-1 to produce unbiased estimates
L.
All of the mentioned
M.
If data is a list, if index is passed the values in data corresponding to the labels in the index will be pulled out
N.
NaN is the standard missing data marker used in pandas
O.
Series acts very similarly to a array
P.
None of the mentioned
Q.
Statsmodels provides powerful statistics, econometrics, analysis and modeling functionality that is out of panda's scope
R.
Vintage leverages pandas objects as the underlying data container for computation
S.
Bokeh is a Python interactive visualization library for small datasets
T.
All of the mentioned
U.
All of the standard pandas data structures have a to_sparse method
V.
Any sparse object can be converted back to the standard dense form by calling to_dense
W.
The sparse objects exist for memory efficiency reasons
X.
All of the mentioned
Answer: _________
Question 25:
Point out the wrong statement.
A.
A DataFrame is like a fixed-size dict in that you can get and set values by index label
B.
Series can be be passed into most NumPy methods expecting an ndarray
C.
A key difference between Series and ndarray is that operations between Series automatically align the data based on label
D.
None of the mentioned
E.
to_array. append can accept scalar values or any 2-dimensional sequence
F.
Two kinds of SparseIndex are implemented
G.
The integer format keeps an arrays of all of the locations where the data are not equal to the fill value
H.
None of the mentioned
I.
lxml is very fast
J.
lxml requires Cython to install correctly
K.
lxml does not make any guarantees about the results of it's parse
L.
none of the mentioned
M.
Series is 1D labeled homogeneously-typed array
N.
DataFrame is general 2D labeled, size-mutable tabular structure with potentially heterogeneously-typed columns
O.
Panel is generally 2D labeled, also size-mutable array
P.
None of the mentioned
Q.
min, max, idxmin, idxmax operations are supported on Series
R.
You cannot pass a timedelta to get a particular value
S.
Division by the numpy scalar is true division
T.
None of the mentioned
U.
qgrid is an interactive grid for sorting and filtering DataFrames
V.
Pandas DataFrames implement _repr_html_ methods which are utilized by IPython Notebook
W.
Spyder is a cross-platform Qt-based open-source R IDE
X.
None of the mentioned
Answer: _________
Question 26:
. . . . . . . . plots are used to visually assess the uncertainty of a statistic.
A.
Lag
B.
RadViz
C.
Bootstrap
D.
None of the mentioned
Answer: _________
Question 27:
Andrews curves allow one to plot multivariate data.
A.
True
B.
False
Answer: _________
Question 28:
Which of the following is not an indexed object?
A.
SparseSeries
B.
SparseDataFrame
C.
SparsePanel
D.
None of the mentioned
Answer: _________
Question 29:
Which of the following is implemented on DataFrame to compute the correlation between like-labeled Series contained in different DataFrame objects?
A.
corrwith
B.
corwith
C.
corwit
D.
none of the mentioned
Answer: _________
Question 30:
Which of the following plots are often used for checking randomness in time series?
A.
Autocausation
B.
Autorank
C.
Autocorrelation
D.
None of the mentioned
Answer: _________
Question 31:
What is the primary purpose of the "merge" function in Pandas?
A.
Aggregate data
B.
Sort data
C.
Combine DataFrames
D.
Group data
Answer: _________
Question 32:
In Python, which library provides tools for working with complex data structures and data manipulation, often used for data analysis in the financial sector?
A.
Seaborn
B.
Dask
C.
Matplotlib
D.
None of the above
Answer: _________
Question 33:
Which Python library is commonly used for creating interactive data dashboards and reports for data analysis and visualization?
A.
Panel
B.
Pandas
C.
Seaborn
D.
Bokeh
Answer: _________
Question 34:
What is the primary purpose of the "stack" method in Pandas?
A.
Group data
B.
Sort data
C.
Aggregate data
D.
Reshape data
Answer: _________
Question 35:
In Python, which library provides tools for working with graph data structures and performing graph analysis tasks?
A.
Pandas
B.
Seaborn
C.
NetworkX
D.
Statsmodels
Answer: _________
Question 36:
In Pandas, which method is used to calculate the mode of a numerical column in a DataFrame?
A.
sum()
B.
mode()
C.
median()
D.
None of the above
Answer: _________
Question 37:
Which Python library is commonly used for data preprocessing tasks, such as feature scaling and transformation in machine learning?
A.
Matplotlib
B.
Numpy
C.
Scikit-learn
D.
Pandas
Answer: _________
Question 38:
What is the primary purpose of the "groupby" method in Pandas when working with categorical data?
A.
Sorting data
B.
Filtering data
C.
Reshaping data
D.
Grouping and aggregating data
Answer: _________
Question 39:
In Python, which library is commonly used for web scraping and extracting data from websites?
A.
BeautifulSoup
B.
Pandas
C.
Seaborn
D.
Matplotlib
Answer: _________
Question 40:
What Python function is used to create a new column in a Pandas DataFrame based on the values of existing columns?
A.
pd.read_csv()
B.
np.array()
C.
plt.plot()
D.
df['new_column'] = ...
Answer: _________
Question 41:
In Pandas, what method is used to calculate the mean, median, and other summary statistics for numerical columns in a DataFrame?
A.
sort_values()
B.
melt()
C.
describe()
D.
pivot()
Answer: _________
Question 42:
Which Python library is commonly used for working with geospatial data and creating maps and geographic visualizations?
A.
Pandas
B.
Seaborn
C.
Plotly
D.
Geopandas
Answer: _________
Question 43:
What is the primary purpose of the "applymap" method in Pandas?
A.
Apply a function element-wise
B.
Sort a DataFrame
C.
Reshape data
D.
Group data
Answer: _________
Question 44:
In Python, which library provides tools for working with large and multidimensional arrays and matrices and is often used in scientific computing?
A.
Seaborn
B.
Numpy
C.
Pandas
D.
SciPy
Answer: _________
Question 45:
Which Python library is commonly used for working with network data and performing network analysis tasks?
A.
Matplotlib
B.
Pandas
C.
NetworkX
D.
Scikit-learn
Answer: _________
Question 46:
In Pandas, which method is used to concatenate two or more DataFrames vertically (along rows)?
A.
stack()
B.
concat()
C.
unstack()
D.
None of the above
Answer: _________
Question 47:
What is the primary purpose of the "pivot_table" function in Pandas?
A.
Group data
B.
Sort data
C.
Reshape data
D.
Aggregate data
Answer: _________
Question 48:
In Python, which library is commonly used for natural language processing (NLP) tasks such as text classification and sentiment analysis?
A.
Pandas
B.
Seaborn
C.
Statsmodels
D.
spaCy
Answer: _________
Question 49:
Which Python library is often used for deep learning tasks, including neural network construction and training?
A.
TensorFlow
B.
Pandas
C.
Numpy
D.
Seaborn
Answer: _________
Question 50:
In Pandas, which method is used to fill missing values in a DataFrame with specified values or a filling method?
A.
melt()
B.
drop_duplicates()
C.
dropna()
D.
fillna()
Answer: _________
Question 51:
The plot method on Series and DataFrame is just a simple wrapper around . . . . . . . .
A.
gplt.plot()
B.
plt.plot()
C.
plt.plotgraph()
D.
none of the mentioned
Answer: _________
Question 52:
If data is an ndarray, index must be the same length as data.
A.
True
B.
False
Answer: _________
Question 53:
Which of the following thing can be data in Pandas?
A.
a python dict
B.
an ndarray
C.
a scalar value
D.
all of the mentioned
Answer: _________
Question 54:
Which of the following is used for testing for membership in the list of column names?
A.
in
B.
out
C.
elseif
D.
none of the mentioned
Answer: _________
Question 55:
Which of the following operation works with the same syntax as the analogous dict operations?
A.
Getting columns
B.
Setting columns
C.
Deleting columns
D.
All of the mentioned
Answer: _________
Question 56:
Which of the following is used to generate an index with time delta?
A.
TimeIndex
B.
TimedeltaIndex
C.
LeadIndex
D.
None of the mentioned
Answer: _________
Question 57:
Which of the following library is similar to Pandas?
A.
NumPy
B.
RPy
C.
OutPy
D.
None of the mentioned
Answer: _________
Question 58:
Which of the following input can be accepted by DataFrame?
A.
Structured ndarray
B.
Series
C.
DataFrame
D.
All of the mentioned
Answer: _________
Question 59:
Numeric reduction operation for timedelta64[ns] will return . . . . . . . . objects.
A.
Timeseries
B.
Timeplus
C.
Timedelta
D.
None of the mentioned
Answer: _________
Question 60:
rolling_count function gives the number of non-null observations.
A.
True
B.
False
Answer: _________
Question 61:
Which of the following provides a standard API for doing computations with MongoDB?
A.
Blaze
B.
Geopandas
C.
FRED
D.
All of the mentioned
Answer: _________
Question 62:
Which of the following method produces a data ranking with ties being assigned the mean of the ranks for the group?
A.
rank
B.
dense_rank
C.
partition_rank
D.
none of the mentioned
Answer: _________
Question 63:
The result of an operation between unaligned Series will have the . . . . . . . . of the indexes involved.
A.
intersection
B.
union
C.
total
D.
all of the mentioned
Answer: _________
Question 64:
x-ray brings the labeled data power of pandas to the physical sciences.
A.
True
B.
False
Answer: _________
Question 65:
Which of the following works analogously to the form of the dict constructor?
A.
DataFrame.from_items
B.
DataFrame.from_records
C.
DataFrame.from_dict
D.
All of the mentioned
Answer: _________
Question 66:
Pandas follow the NumPy convention of raising an error when you try to convert something to a bool.
A.
True
B.
False
Answer: _________
Question 67:
The integer format tracks only the locations and sizes of blocks of data.
A.
True
B.
False
Answer: _________
Question 68:
Which of the following specifies the required minimum number of observations for each column pair in order to have a valid result?
A.
min_periods
B.
max_periods
C.
minimum_periods
D.
all of the mentioned
Answer: _________
Question 69:
Pandas consist of static and moving window linear and panel regression.
A.
True
B.
False
Answer: _________
Question 70:
Point out the wrong combination with regards to kind keyword for graph plotting.
A.
'scatter' for scatter plots
B.
'kde' for hexagonal bin plots
C.
'pie' for pie plots
D.
none of the mentioned
Answer: _________
Question 71:
Panel is a container for Series, and DataFrame is a container for dataFrame objects.
A.
True
B.
False
Answer: _________
Question 72:
Which of the following object has a method cov to compute covariance between series?
A.
Series
B.
DataFrame
C.
Panel
D.
None of the mentioned
Answer: _________
Question 73:
Combination of TimedeltaIndex with DatetimeIndex allow certain combination operations that are NaT preserving.
A.
True
B.
False
Answer: _________
Question 74:
Which of the following method is used for transforming a SparseSeries indexed by a MultiIndex to a scipy.sparse.coo_matrix?
A.
SparseSeries.to_coo()
B.
Series.to_coo()
C.
SparseSeries.to_cooser()
D.
None of the mentioned
Answer: _________
Question 75:
Series is a one-dimensional labeled array capable of holding any data type.
A.
True
B.
False
Answer: _________
Question 76:
cov and corr supports the optional min_periods keyword.
A.
True
B.
False
Answer: _________
Question 77:
Plots may also be adorned with error bars or tables.
A.
True
B.
False
Answer: _________
Question 78:
Spyder can introspect and display Pandas DataFrames.
A.
True
B.
False
Answer: _________
Question 79:
Which of the following operations are supported on Time Frames?
A.
idxmax
B.
ixmax
C.
ixmin
D.
none of the mentioned
Answer: _________
Question 80:
In Pandas, what method is used to remove missing values from a DataFrame?
A.
groupby()
B.
drop_duplicates()
C.
dropna()
D.
fillna()
Answer: _________
Question 81:
Which Python library is commonly used for machine learning and predictive modeling?
A.
Matplotlib
B.
Pandas
C.
Numpy
D.
Scikit-learn
Answer: _________
Question 82:
What Python library provides interactive and web-based data visualizations?
A.
Plotly
B.
Seaborn
C.
Pandas
D.
Matplotlib
Answer: _________
Question 83:
What is the primary data structure in Numpy used for storing and performing operations on numerical data?
A.
Tensor
B.
Series
C.
DataFrame
D.
ndarray
Answer: _________
Question 84:
In Pandas, what method is used to calculate summary statistics for numerical columns in a DataFrame?
A.
groupby()
B.
value_counts()
C.
describe()
D.
reshape()
Answer: _________
Question 85:
Which Python library is commonly used for data exploration and data cleaning tasks, providing interactive and user-friendly interfaces?
A.
Pandas
B.
Jupyter Notebook
C.
Seaborn
D.
None of the above
Answer: _________
Question 86:
In Pandas, which method is used to sort a DataFrame by the values in one or more columns in ascending or descending order?
A.
apply()
B.
pivot_table()
C.
sort_values()
D.
filter()
Answer: _________
Question 87:
What is the primary purpose of the Seaborn library in Python for data analysis and visualization?
A.
Machine learning
B.
Data manipulation
C.
Web development
D.
Statistical data visualization
Answer: _________
Question 88:
In Python, which library is often used for creating interactive dashboards and web applications for data analysis and visualization?
A.
Dash
B.
Numpy
C.
Matplotlib
D.
Pandas
Answer: _________
Question 89:
What Python function is used to perform element-wise mathematical operations on Numpy arrays?
A.
describe()
B.
plt.plot()
C.
df.apply()
D.
np.add()
Answer: _________
Question 90:
In Pandas, which method is used to pivot data from long to wide format, creating a new DataFrame?
A.
melt()
B.
reshape()
C.
pivot()
D.
groupby()
Answer: _________
Question 91:
Which Python library provides tools for working with structured data and is often used in data analysis tasks, especially in financial data?
A.
Seaborn
B.
PySpark
C.
Scikit-learn
D.
None of the above
Answer: _________
Question 92:
What is the primary purpose of the "melt" function in Pandas?
A.
Grouping data
B.
Aggregating data
C.
Reshaping data
D.
Sorting data
Answer: _________
Question 93:
In Python, which library is widely used for time series analysis and forecasting?
A.
Numpy
B.
Matplotlib
C.
Pandas
D.
Statsmodels
Answer: _________
Question 94:
Which Python library provides advanced data visualization capabilities and is often used for creating complex and interactive plots?
A.
Bokeh
B.
Pandas
C.
Seaborn
D.
Plotly
Answer: _________
Question 95:
What Python method is used to calculate the correlation matrix for a DataFrame, showing the relationships between numerical columns?
A.
groupby()
B.
describe()
C.
pivot_table()
D.
corr()
Answer: _________
Question 96:
In Pandas, which method is used to apply a custom function to each element of a DataFrame or Series?
A.
pivot()
B.
melt()
C.
apply()
D.
sort_values()
Answer: _________
Question 97:
Which Python library is commonly used for natural language processing (NLP) tasks in data analysis, including text mining and sentiment analysis?
A.
Seaborn
B.
NLTK
C.
Scikit-learn
D.
None of the above
Answer: _________
Question 98:
What is the primary purpose of the "crosstab" function in Pandas?
A.
Data transformation
B.
Data visualization
C.
Cross-tabulation of categorical data
D.
Data cleaning and preprocessing
E.
Data cleaning and preprocessing
F.
Data visualization
G.
Cross-tabulation of categorical data
H.
Data transformation
Answer: _________
Question 99:
In Python, which library is often used for creating animated and interactive data visualizations?
A.
SciPy
B.
Pandas
C.
Matplotlib
D.
Plotly Express
Answer: _________
Question 100:
In Pandas, which method is used to create a subset of a DataFrame by selecting rows based on a specified condition?
A.
df[df['column_name'] > value]
B.
pivot_table()
C.
melt()
D.
sort_values()
Answer: _________
Question 101:
Which Python library is commonly used for data profiling and generating summary statistics for datasets?
A.
Statsmodels
B.
Seaborn
C.
Pandas
D.
pandas-profiling
Answer: _________
Question 102:
What is the primary purpose of the "loc" method in Pandas when working with a DataFrame?
A.
Grouping and aggregating data
B.
Sorting data
C.
Selecting rows and columns
D.
Filtering data
Answer: _________
Question 103:
In Python, which library provides tools for working with databases and performing SQL queries using Pandas data structures?
A.
SQLite
B.
SQLAlchemy
C.
SQLtools
D.
None of the above
Answer: _________
Question 104:
Which Python library is commonly used for dimensionality reduction and feature selection techniques in machine learning?
A.
Scikit-learn
B.
Seaborn
C.
FeatureTools
D.
Pandas
Answer: _________
Question 105:
In Pandas, what method is used to apply a function to each element of a Series?
A.
pivot()
B.
melt()
C.
sort_values()
D.
apply()
Answer: _________
Question 106:
What is the primary purpose of the "melt" function in Pandas when reshaping data?
A.
Convert wide data to long format
B.
Sorting data
C.
Aggregating data
D.
Grouping data
Answer: _________
Question 107:
In Python, which library provides tools for working with data in Excel files, including reading, writing, and manipulation?
A.
Matplotlib
B.
Seaborn
C.
Pandas
D.
openpyxl
Answer: _________
Question 108:
Which Python library is commonly used for creating interactive data dashboards and web applications for data visualization?
A.
Pandas
B.
Seaborn
C.
Dash
D.
Bokeh
Answer: _________
Question 109:
In Pandas, which method is used to create a new column based on conditional logic applied to existing columns?
A.
melt()
B.
df['new_column'] = np.where(...)
C.
sort_values()
D.
None of the above
Answer: _________
Question 110:
The . . . . . . . . project builds on top of pandas and matplotlib to provide easy plotting of data.
A.
yhat
B.
Seaborn
C.
Vincent
D.
None of the mentioned
Answer: _________
Question 111:
Which of the following plots are used to check if a data set or time series is random?
A.
Lag
B.
Random
C.
Lead
D.
None of the mentioned
Answer: _________
Question 112:
Which of the following is a foundational exploratory visualization package for the R language in pandas ecosystem?
A.
yhat
B.
Seaborn
C.
Vincent
D.
None of the mentioned
Answer: _________
Question 113:
Quandl API for Python wraps the . . . . . . . . REST API to return Pandas DataFrames with time series indexes.
A.
Quandl
B.
PyDatastream
C.
PyData
D.
None of the mentioned
Answer: _________
Question 114:
You can create a scatter plot matrix using the . . . . . . . . method in pandas.tools.plotting.
A.
sca_matrix
B.
scatter_matrix
C.
DataFrame.plot
D.
all of the mentioned
Answer: _________
Question 115:
Which of the following takes a dict of dicts or a dict of array-like sequences and returns a DataFrame?
A.
DataFrame.from_items
B.
DataFrame.from_records
C.
DataFrame.from_dict
D.
All of the mentioned
Answer: _________
Question 116:
What library in Python is commonly used for data manipulation and analysis?
A.
Pandas
B.
Matplotlib
C.
Numpy
D.
Scikit-learn
Answer: _________
Question 117:
In Python, which module provides functions for working with dates and times?
A.
random
B.
math
C.
numpy
D.
datetime
Answer: _________
Question 118:
What is the primary data structure in Pandas used for storing and working with tabular data?
A.
Series
B.
Array
C.
DataFrame
D.
Dictionary
Answer: _________
Question 119:
Which Python library is widely used for creating data visualizations and plots?
A.
Numpy
B.
Matplotlib
C.
Scikit-learn
D.
None of the above
Answer: _________
Question 120:
What Python function is used to read a CSV (Comma-Separated Values) file into a Pandas DataFrame?
A.
df.describe()
B.
plt.savefig()
C.
pd.read_csv()
D.
np.load()
Answer: _________
Question 121:
In Python, what is the main purpose of the "groupby" operation in Pandas?
A.
Sorting data
B.
Filtering data
C.
Reshaping data
D.
Grouping and aggregating data
Answer: _________
Question 122:
Which Python library is commonly used for statistical analysis and hypothesis testing?
A.
SciPy
B.
Pandas
C.
Seaborn
D.
Plotly
Answer: _________
Question 123:
What is the primary purpose of the Matplotlib library in Python?
A.
Statistical analysis
B.
Machine learning
C.
Data manipulation
D.
Creating data visualizations
Answer: _________
Question 124:
Which Python library provides tools for working with multi-dimensional arrays and matrices?
A.
Matplotlib
B.
Pandas
C.
Numpy
D.
Scikit-learn
Answer: _________
Question 125:
What Python function is used to select specific rows and columns from a Pandas DataFrame?
A.
np.array()
B.
df.loc[]
C.
pd.read_csv()
D.
None of the above
Answer: _________
Question 126:
In Pandas, what method is used to remove duplicate rows from a DataFrame based on specified columns?
A.
dropna()
B.
fillna()
C.
drop_duplicates()
D.
groupby()
Answer: _________
Question 127:
Which Python library is commonly used for time series forecasting and anomaly detection tasks?
A.
Statsmodels
B.
Seaborn
C.
Prophet
D.
Pandas
Answer: _________
Question 128:
What is the primary purpose of the "pivot" method in Pandas?
A.
Aggregate data
B.
Sort data
C.
Group data
D.
Reshape data
Answer: _________
Question 129:
In Python, which library is often used for natural language processing (NLP) tasks such as text tokenization and part-of-speech tagging?
A.
NLTK
B.
Pandas
C.
Seaborn
D.
Scikit-learn
Answer: _________
Question 130:
Which Python library is commonly used for data visualization and exploration with a focus on producing informative statistical graphics?
A.
Matplotlib
B.
Seaborn
C.
Pandas
D.
ggplot
Answer: _________
Question 131:
In Python, which library provides tools for working with complex and hierarchical data structures, often used in scientific computing?
A.
Seaborn
B.
Xarray
C.
Dask
D.
None of the above
Answer: _________
Question 132:
Which Python library is commonly used for working with SQL databases and performing SQL-like operations on dataframes?
A.
SQLalchemy
B.
SQLite
C.
Pandas SQL
D.
Pandas
Answer: _________
Question 133:
What is the primary purpose of the "fillna" method in Pandas?
A.
Remove duplicates
B.
Group data
C.
Sort data
D.
Fill missing values
Answer: _________
Question 134:
In Python, which library is often used for deep learning tasks, including neural network construction and training, with a focus on simplicity and flexibility?
A.
Numpy
B.
Keras
C.
TensorFlow
D.
None of the above
Answer: _________
Question 135:
Point out the correct combination with regards to kind keyword for graph plotting.
A.
'hist' for histogram
B.
'box' for boxplot
C.
'area' for area plots
D.
all of the mentioned
Answer: _________
Question 136:
Using . . . . . . . . on categorical data will produce similar output to a Series or DataFrame of type string.
A.
.desc()
B.
.describe()
C.
.rank()
D.
none of the mentioned
Answer: _________
Question 137:
Which of the following is prominent python "statistics and econometrics library"?
A.
Bokeh
B.
Seaborn
C.
Statsmodels
D.
None of the mentioned
Answer: _________
Question 138:
Which of the following list-like data structure is used for managing a dynamic collection of SparseArrays?
A.
SparseList
B.
GeoList
C.
SparseSeries
D.
All of the mentioned
Answer: _________
Question 139:
Which of the following object you get after reading CSV file?
A.
DataFrame
B.
Character Vector
C.
Panel
D.
All of the mentioned
Answer: _________
Question 140:
Which of the following value is provided by kind keyword for barplot?
A.
bar
B.
kde
C.
hexbin
D.
none of the mentioned
Answer: _________
Question 141:
Which of the following library is used to retrieve and acquire statistical data and metadata disseminated in SDMX 2.1?
A.
pandaSDMX
B.
freedapi
C.
geopandas
D.
all of the mentioned
Answer: _________
Question 142:
All values of categorical data are either in categories or np.nan.
A.
True
B.
False
Answer: _________
Question 143:
Which of the following statement will import pandas?
A.
import pandas as pd
B.
import panda as py
C.
import pandaspy as pd
D.
all of the mentioned
Answer: _________
Question 144:
Which of the following can potentially change the dtype of a series?
A.
reindex_like
B.
index_like
C.
itime_like
D.
none of the mentioned
Answer: _________
Question 145:
Which of the following scalars can be converted to other 'frequencies' by as typing to a specific timedelta type?
A.
Timedelta Series
B.
TimedeltaIndex
C.
Timedelta
D.
All of the mentioned
Answer: _________
Question 146:
Which of the following is used to compute the percent change over a given number of periods?
A.
pct_change
B.
percent_change
C.
per_change
D.
none of the mentioned
Answer: _________
Question 147:
Which of the following method can be used to rename categorical data?
A.
Categorical.rename_categories()
B.
Categorical.rename()
C.
Categorical.mv_categories()
D.
None of the mentioned
Answer: _________
Question 148:
Which of the following is the base layer for all of the sparse indexed data structures?
A.
SArray
B.
SparseArray
C.
PyArray
D.
None of the mentioned
Answer: _________
Question 149:
All pandas data structures are . . . . . . . . mutable but not always . . . . . . . . mutable.
A.
size, value
B.
semantic, size
C.
value, size
D.
none of the mentioned
Answer: _________
Answer Key
1:
A
2:
D
3:
D
4:
A
5:
D
6:
C
7:
B
8:
C
9:
D
10:
C
11:
B
12:
C
13:
D
14:
C
15:
D
16:
A
17:
D
18:
D
19:
C
20:
B
21:
B
22:
A
23:
B
24:
D, E, K, N, Q, X
25:
A, E, K, O, R, W
26:
C
27:
A
28:
D
29:
A
30:
C
31:
C
32:
B
33:
A
34:
D
35:
C
36:
B
37:
C
38:
D
39:
A
40:
D
41:
C
42:
D
43:
A
44:
D
45:
C
46:
B
47:
C
48:
D
49:
A
50:
D
51:
B
52:
A
53:
D
54:
A
55:
D
56:
B
57:
A
58:
D
59:
C
60:
B
61:
A
62:
A
63:
B
64:
A
65:
A
66:
A
67:
B
68:
A
69:
A
70:
B
71:
B
72:
A
73:
A
74:
A
75:
A
76:
A
77:
A
78:
B
79:
A
80:
C
81:
D
82:
A
83:
D
84:
C
85:
A
Solution: Pandas is a popular Python library widely used for data exploration and data cleaning tasks. It provides easy-to-use, flexible, and powerful data structures, especially DataFrames, which allow for efficient data manipulation and analysis. With Pandas, users can perform operations such as filtering, merging, grouping, and reshaping data quickly and effectively. It is commonly used in conjunction with other libraries like NumPy and Matplotlib to extend its data handling capabilities and support data visualization.
86:
C
87:
D
88:
A
89:
D
90:
C
91:
B
92:
C
93:
D
94:
A
95:
D
96:
C
97:
B
98:
C, G
99:
D
100:
A
101:
D
102:
C
103:
B
104:
C
105:
D
106:
A
107:
D
108:
C
109:
B
110:
B
111:
A
112:
A
113:
A
114:
B
115:
A
116:
A
117:
D
118:
C
119:
B
120:
C
121:
D
122:
A
123:
D
124:
C
125:
B
126:
C
127:
C
128:
D
129:
A
130:
D
131:
B
132:
C
133:
D
134:
B
135:
D
136:
B
137:
C
138:
A
139:
A
140:
A
141:
A
142:
A
143:
A
144:
A
145:
D
146:
A
147:
A
148:
B
149:
C