Found 507 Articles for Pandas

How to check if any value is NaN in a Pandas DataFrame?

Rishikesh Kumar Rishi
Updated on 30-Aug-2021 11:49:21

781 Views

To check if any value is NaN in a Pandas DataFrame, we can use isnull().values.any() method.StepsMake a series, s, one-dimensional ndarray with axis labels (including time series).Print the series, s.Check whether NaN is present or not.Create a two-dimensional, size-mutable, potentially heterogeneous tabular data, df.Print the input DataFrame.Check whether NaN is present or not.Example Live Demoimport pandas as pd import numpy as np s = pd.Series([1, np.nan, 3, np.nan, 3, np.nan, 7, np.nan, 3]) print "Input series is:", s present = s.isnull().values.any() print "NAN is present in series: ", present df = pd.DataFrame(    {       "x": [5, ... Read More

How to reset hierarchical index in Pandas?

Rishikesh Kumar Rishi
Updated on 30-Aug-2021 11:47:12

272 Views

To reset hierarchical index in Pandas, we can use reset_index() method.StepsCreate a two-dimensional, size-mutable, potentially heterogeneous tabular data, df.Print the input DataFrame.Use groupby to get different levels of a hierarchical index and count it.Print multi-hierarchical index DataFrame.Reset the multi-hierarchical index DataFrame, using df.reset_index().Print the new updated DataFrame.Example Live Demoimport pandas as pd df = pd.DataFrame({"x": [5, 2, 1, 9], "y": [4, 1, 5, 10]}) print "Input DataFrame is:", df df1 = df.groupby(["x", "y"]).count() print "Hierarchical Index of input DataFrame is:", df1 df2 = df1.reset_index() print "After resetting: ", df2OutputInput DataFrame is:    x  y 0  5   ... Read More

How to make a multi-index in Pandas?

Rishikesh Kumar Rishi
Updated on 30-Aug-2021 10:11:46

244 Views

To make a multi-index in Pandas, we can use groupby with list of columns.StepsCreate a two-dimensional, size-mutable, potentially heterogeneous tabular data, df.Print the input DataFrame.Print the index of DataFrame count.Use groupby to get different levels of a hierarchical index and count it.Print the mulitindex set in step 4.Example Live Demoimport pandas as pd df = pd.DataFrame(    {       "x": [5, 2, 1, 9],       "y": [4, 1, 5, 10],       "z": [4, 1, 5, 0]    } ) print "Input DataFrame is:", df print "Default index: ", df.count().index df1 = df.groupby(["x", "y"]).count() ... Read More

Convert a Pandas DataFrame to a NumPy array

Rishikesh Kumar Rishi
Updated on 30-Aug-2021 10:02:39

596 Views

To convert a Pandas DataFrame to a NumPy array, we can use to_numpy().StepsCreate a two-dimensional, size-mutable, potentially heterogeneous tabular data, df.Print the input DataFrame.Print the NumPy array of the given array, using df.to_numpy().Print the NumPy array of the given array for a specific column, using df['x'].to_numpy().Example Live Demoimport pandas as pd df = pd.DataFrame(    {       "x": [5, 2, 1, 9],       "y": [4, 1, 5, 10],       "z": [4, 1, 5, 0]    } ) print "Input DataFrame is:", df print "DataFrame to numpy is:", df.to_numpy() print "DataFrame to numpy is:", df['x'].to_numpy()OutputInput ... Read More

How to count the NaN values in a column in a Python Pandas DataFrame?

Rishikesh Kumar Rishi
Updated on 30-Aug-2021 09:57:44

2K+ Views

To count the NaN values in a column in a Pandas DataFrame, we can use the isna() method with sum.StepsCreate a series, s, one-dimensional ndarray with axis labels (including time series).Print the series, s.Count the number of NaN present in the series.Create a two-dimensional, size-mutable, potentially heterogeneous tabular data, df.Print the input DataFrame.Find NaN count column wise.Print the count DataFrame.Example Live Demoimport pandas as pd import numpy as np s = pd.Series([1, np.nan, 3, np.nan, 3, np.nan, 7, np.nan, 3]) print "Input series is:", s count = s.isna().sum() print "NAN count in series: ", count df = pd.DataFrame(   ... Read More

Deleting a DataFrame row in Python Pandas based on column value

Rishikesh Kumar Rishi
Updated on 30-Aug-2021 09:45:30

3K+ Views

To delete a DataFrame row in Pandas based on column value, we can take the following Steps −StepsCreate a two-dimensional, size-mutable, potentially heterogeneous tabular data, df.Print the input DataFrame.Here, we will delete the row from the DataFrame that contains 0 in its Z-column, using df=df[df.z != 0]Print the updated DataFrame, after deleting row based on column value.Example Live Demoimport pandas as pd df = pd.DataFrame(    {       "x": [5, 2, 1, 9],       "y": [4, 1, 5, 10],       "z": [4, 1, 5, 0]    } ) print "Input DataFrame is:", df df ... Read More

How are iloc and loc different in Python Pandas?

Rishikesh Kumar Rishi
Updated on 30-Aug-2021 09:42:42

238 Views

Let's take an example to understand the difference between iloc and loc. Basically loc[0] returns the value present at 0 index, whereas iloc[0] returns the value present at the first location of a series.StepsCreate a one-dimensional ndarray with axis labels (including time series).Print the input series.Use loc[0] to print the value present at 0th index.Use iloc[0] to print the value present at the first location of the series table.Example Live Demoimport pandas as pd s = pd.Series(list("AEIOU"), index=[2, 1, 0, 5, 8]) print "Input series is:", s print "Value at index=0:", s.loc[0] print "Value at the 1st location of the series:", ... Read More

Writing a Pandas DataFrame to CSV file

Rishikesh Kumar Rishi
Updated on 30-Aug-2021 09:39:48

4K+ Views

To write a Pandas DataFrame to CSV file, we can take the following Steps −StepsCreate a two-dimensional, size-mutable, potentially heterogeneous tabular data, df.Print the input DataFrame.Use df.to_csv to save the values of the DataFrame to a CSV (comma-separated values) file.Example Live Demoimport pandas as pd df = pd.DataFrame(    {       "x": [5, 2, 1, 9],       "y": [4, 1, 5, 10],       "z": [4, 1, 5, 0]    } ) print "Input DataFrame is:", df df.to_csv("test.csv", sep='\t')OutputInput DataFrame is:    x   y  z 0  5  4  4 1  2  1  1 2  1  5  5 3  9 10  0It will create a new file ("test.csv") and save the values of the DataFrame in it.

Use a list of values to select rows from a Pandas DataFrame

Rishikesh Kumar Rishi
Updated on 30-Aug-2021 09:36:20

2K+ Views

To select the rows from a Pandas DataFrame based on input values, we can use the isin() method.StepsCreate a two-dimensional, size-mutable, potentially heterogeneous tabular data, df.Print the input DataFrame.Create a list of values for selection of rows.Print the selected rows with the given values.Next, print the rows that were not selected.Example Live Demoimport pandas as pd df = pd.DataFrame(    {       "x": [5, 2, 1, 9],       "y": [4, 1, 5, 10],       "z": [4, 1, 5, 0]    } ) print "Input DataFrame:", df values = [1, 2] print "Selected Rows:", ... Read More

Create a Pandas Dataframe by appending one row at a time

Rishikesh Kumar Rishi
Updated on 30-Aug-2021 09:34:34

3K+ Views

To create a Pandas DataFrame by appending one row at a time, we can iterate in a range and add multiple columns data in it.StepsCreate a two-dimensional, size-mutable, potentially heterogeneous tabular data, df.Print the input DataFrame.Iterate in a range of 10.Assign values at different index with numbers.Print the created DataFrame.Example Live Demoimport pandas as pd import random df = pd.DataFrame(    {       "x": [],       "y": [],       "z": []    } ) print "Input DataFrame:", df for i in range(10):    df.loc[i] = [i, random.randint(1, 10), random.randint(1, 10)] print "After ... Read More

Advertisements