在 Pandas DataFrame 上创建视图

很多时候，在进行数据分析时，我们正在处理一个具有很多属性的大型数据集。所有属性不一定同等重要。因此，我们只想使用数据框中的一组列。为此，让我们看看我们如何在 Dataframe 上创建视图并仅选择我们需要的那些列并保留其余列。

有关代码中使用的 CSV 文件的链接，请单击此处。

解决方案 #1：可以通过删除所有不需要的列来选择 DataFrame 中的一组列。

# importing pandas as pd
import pandas as pd
  
# Reading the csv file
df = pd.read_csv("nba.csv")
  
# Print the dataframe
print(df)

输出：

现在我们将选择除前三列之外的所有列。

# drop the first three columns
df.drop(df.columns[[0, 1, 2]], axis = 1)

输出：

我们还可以使用要删除的列的名称。

# drop the 'Name', 'Team' and 'Number' columns
df.drop(['Name', 'Team', 'Number'], axis = 1)

输出：

解决方案#2：我们可以单独选择所有我们需要的列，而忽略其余的列。

# importing pandas as pd
import pandas as pd
  
# Reading the csv file
df = pd.read_csv("nba.csv")
  
# select the first three columns
# and store the result in a new dataframe
df_copy = df.iloc[:, 0:3]
  
# Print the new DataFrame
df_copy

输出：

我们还可以通过将列表传递给DataFrame.iloc属性以随机方式选择列。

# select the first, third and sixth columns
# and store the result in a new dataframe
# The numbering of columns begins from 0
df_copy = df.iloc[:, [0, 2, 5]]
  
# Print the new DataFrame
df_copy

输出：

或者，我们也可以命名我们想要选择的列。

# Select the below listed columns
df_copy = df[['Name', 'Number', 'College']]
  
# Print the new DataFrame
df_copy

输出：