How to do Sql Merge statement on pandas data frame in python

Question

aditi_13 0 Newbie Poster

3 Years Ago

I am new to python dataframes so please help me do a merge on pandas dataframe.

df1

custid   custname    email              phone
 x         tina     z.gmail.com        345-345-3456
 y         mina     z1.gmail.com       445-345-3456
 z         zina     z2.gmail.com       555-345-3456
 q         pina                        233-456-3456

df2

custid   custname    email              phone
 x         tina     z.gmail.com        345-345-3456
 y         xina     z1.gmail.com       445-345-3456
 k         tina     tina@gmail.com    703-234-3456
 q         pina     pina@gmail.com    233-456-3456

I want the desired output.

Insert update and delete .Update should happen if the value in not null in df1.If it is null in df1 and have value in df2 then don't update.

df3

custid   custname    email              phone                    Action
 x         tina     z.gmail.com        345-345-3456               None
 y         mina     z1.gmail.com       445-345-3456               Update
 z         zina     z2.gmail.com       555-345-3456               Insert
 k         tina      tina@gmail.com    703-234-3456               Delete
 q         pina     pina@gmail.com     233-456-3456                 None

python

4 Contributors
4 Replies
316 Views
1 Day Discussion Span
Latest Post 3 Years Ago Latest Post by bboycage

bboycage 16 Newbie Poster

3 Years Ago

df3 = pd.concat([df1, df2])
df3 = df3[df3["email"].notnull()].drop_duplicates(subset = ["email"])

This code produces what you need

rproffitt commented: Excellent. My thought was to keep df1 as the complete result. Now the OP can pick and choose! +16

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

Excellent. My thought was to keep df1 as the complete result. Now the OP can pick and choose!

aditi_13 0 Newbie Poster · Answer 1 · 2021-08-29T15:59:03+00:00

aditi_13 0 Newbie Poster

3 Years Ago

Please any help?

Dani 4,558 The Queen of DaniWeb Administrator Featured Poster Premium Member · Answer 2 · 2021-08-30T17:55:07+00:00

Sorry, unfortunately I don't know python so I can't help you. I'm going to upvote your post so hopefully someone comes along who can help.

rproffitt 2,701 https://5calls.org Moderator · Answer 3 · 2021-08-30T21:27:50+00:00

I am not deeply conversant with this area. Anyhow, I think the merge is one line of code. Psuedo code only so you can rewrite to match your needs.

I think I'd do the merge first before working on the duplicates.

One line? Try:

df1.append(df2)

Try that and see what happens. To clean up duplicates consider:

df1.drop_duplicates(subset=['custname'], keep='last')

Remember that this is just my untested thoughts here and you can tailor as you see fit.

PS. I see there is a merge feature but it's an area I have yet to explore. Try finding tutorials on this area.