Question : get duplicate and remove but keep last in python df
Answered by : sachin-verma
drop_duplicates(self, subset=None, keep="last", inplace=False)
Source : | Last Update : Sun, 27 Sep 20
Question : pandas drop duplicate keep last
Answered by : acm-c
df = df.sort_values('timestamp').drop_duplicates(['customer_id','var_name'], keep='last')
print (df) customer_id value var_name timestamp
0 1 1 apple 2018-03-22 00:00:00.000
3 1 1 orange 2018-03-22 08:00:00.000
2 2 4 apple 2018-03-24 08:00:00.000
4 2 3 orange 2018-03-24 08:00:00.000
Source : https://stackoverflow.com/questions/49425727/how-do-i-drop-duplicates-and-keep-the-last-timestamp-on-pandas | Last Update : Sat, 30 Jul 22
Question : pandas drop duplicates but keep most recent date
Answered by : rich-ray-2zv3o6ebizt9
df.sort_values('DATE_CHANGED').drop_duplicates('STATION_ID',keep='last')
Source : https://stackoverflow.com/questions/52395820/drop-duplicates-keep-most-recent-date-pandas-dataframe | Last Update : Mon, 28 Feb 22