Question

Exercise 3: Remove rows with missing data You need to use the dropna() method of a dataframe. More info here: https:/pandas p
0 0
Add a comment Improve this question Transcribed image text
Answer #1

##in question the dataset is not provided, so creating some

##random data

##In case of any issues please comment, i will definitely help you.

##please rate +ve :)

import numpy as np

import pandas as pd

##in question the original dataframe is not given,

##assuming salaries is the dataframe name

##below is dummy data set

data=[[1,'Maths',10,5,'M',20000],[2,'Maths',10,5,'M',1000],[1,np.nan,10,5,'M',20000],[1,'Maths',10,5,np.nan,20000]]

salaries = pd.DataFrame(data, columns=['rank','discipline','yrs.since.phd','yrs.service','sex','salaey'])

##below is the only code you require.

salaries_nonull=salaries.dropna()

salaries_nonull.count()


##below is the only code you require. salaries_nonull-salaries.dropna() salaries_nonull.count() rank discipline yrs.since.phd

Note: Just last two lines the answer for your question. i have done on dummy data as original data is not provided in the question

Add a comment
Know the answer?
Add Answer to:
Exercise 3: Remove rows with missing data You need to use the dropna() method of a...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • Exercise 2: Summarize the data Get some quick stats with the describe() method. - In [6]:...

    Exercise 2: Summarize the data Get some quick stats with the describe() method. - In [6]: # Your code here Out[6] salary yrs since.phd yrs.service 399.000000 399.000000 count 400.000000 22.233083 12.909360 17.601504 113538 660000 12.987009 30253.387593 0.000000 57800.000000 7.000000 91000.000000 1.000000 25% 12 000000 50% 21.000000 16.000000 107175.000000 75% 32.000000 56 000000 26.500000 134046250000 60.000000 231545 000000 ma describe only gives you stats for quantitative values Let's find out how many observations we have for the rank and sex variables....

  • (a) Load the data file data/tips.csv into a pandas DataFrame called tips_df using the pandas read_table()...

    (a) Load the data file data/tips.csv into a pandas DataFrame called tips_df using the pandas read_table() function. Check the first five rows. (b) Create a new dataframe called tips by randomly sampling 6 records from the dataframe tips_df. Refer to the sample() function documentation. (c) Add a new column to tips called idx as a list ['one', 'two', 'three', 'four', 'five', 'six'] and then later assign it as the index of tips dataframe. Display the dataframe. (d) Create a new...

  • Lab Exercise #15 Assignment Overview This lab exercise provides practice with Pandas data analysis library. Data...

    Lab Exercise #15 Assignment Overview This lab exercise provides practice with Pandas data analysis library. Data Files We provide three comma-separated-value file, scores.csv , college_scorecard.csv, and mpg.csv. The first file is list of a few students and their exam grades. The second file includes data from 1996 through 2016 for all undergraduate degree-granting institutions of higher education. The data about the institution will help the students to make decision about the institution for their higher education such as student completion,...

  • Hi it's python I imported a data which are so many words in txt and I arranged and reshaped with ...

    Hi it's python I imported a data which are so many words in txt and I arranged and reshaped with alphabetically both rows and columns I was successful with these steps but I am stuck with next step below is my code and screenshot import numpy as np import pandas as pd data=pd.read_csv("/Users/superman/Downloads/words_file2.txt",header=None) df_input=pd.DataFrame(data) df_output=pd.DataFrame(np.arange(676).reshape((26,26)), index = ['a','b','c','d','e','f','g','h','i','j','k','l','m','n','o','p','q','r','s','t','u','v','w','x','y','z'], columns = ['a','b','c','d','e','f','g','h','i','j','k','l','m','n','o','p','q','r','s','t','u','v','w','x','y','z']) df_output.index.name="Start" df_output.columns.name="End" df_output This below screen shot is what I have to find I have to find each word...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT