Негізгі бет Capgemini Data Engineer Interview Question - Round 1 | Save Multiple Columns in the DataFrame |

Күн бұрын

Capgemini Data Engineer Interview Question - Round 1 | Save Multiple Columns in the DataFrame |

Рет қаралды 15,044

GeekCoders

Жүктеу

Пікірлер: 27

@kunuturuaravindreddy5879
2 ай бұрын
very good you are posting real interview questions many of them simply explain concer defentitiins
@GeekCoders
2 ай бұрын
@@kunuturuaravindreddy5879 thanks
@sourav_sarkar_2000
8 ай бұрын
# creating a dict of columns as to avoid checking multiple datatypes d={} for col in df.dtypes: if col[1] not in d: d[col[1]] = [col[0]] else:d[col[1]].append(col[0]) for key,val in d.items(): df.select(val).show() # write df to the location
@aamirmansuri69
9 ай бұрын
Thank you for posting this video. But, can you please post pyspark interview questions for freshers. Thank you!
@myl1566
9 ай бұрын
Good problem to solve. Thanks for posting sagar!
@GeekCoders
9 ай бұрын
Thank you
@sourav_sarkar_2000
8 ай бұрын
# creating a dict of columns to avoid checking multiple datatypes d={} for col in df.dtypes: if col[1] not in d: d[col[1]] = [col[0]] else:d[col[1]].append(col[0]) print(d) for key,val in d.items(): df.select(val).show() # write df to the location # df.write.mode('overwrite').save(f'temp_loc/{key}')
@Offical_PicturePerfect
2 ай бұрын
int_cols = [col for col, dtype in df.dtypes if dtype == 'int'] string_cols = [col for col, dtype in df.dtypes if dtype == 'string'] float_cols = [col for col, dtype in df.dtypes if dtype == 'float'] Creating DataFrames for each data type int_df = df.select(int_cols) string_df = df.select(string_cols) float_df = df.select(float_cols)
@Dataengineeringlearninghub
9 ай бұрын
Great problem sagar
@rawat7203
9 ай бұрын
My Way Sir intType = [] stringType = [] floatType = [] for i in df.dtypes: if i[1] == 'int': intType.append(i[0]) elif i[1] == 'string': stringType.append(i[0]) elif i[1] == 'float': floatType.append(i[0]) dfInt = df.select(*intType) dfString = df.select(*stringType) dfFloat = df.select(*floatType)
@GeekCoders
9 ай бұрын
Nice
@vutv5742
9 ай бұрын
Completed 👏
@rawat7203
9 ай бұрын
Thanks a lot Sir
@GeekCoders
8 ай бұрын
Thank you
@Nextgentrick
8 ай бұрын
Shouldn’t you use append instead of overwrite
@ug1880
8 ай бұрын
Were u asked for any imocha test ?
@GeekCoders
8 ай бұрын
No
@ug1880
8 ай бұрын
@@GeekCoders okk...
@pradishpranam6175
8 ай бұрын
cool question
@bhumikalalchandani321
9 ай бұрын
okay, is this internal functionality of conversion to parq format
@rawat7203
9 ай бұрын
yes
@pratyushkumar8567
9 ай бұрын
Hi Sagar this Capgemini Data Engineer Interview Question - Round 1 | Save Multiple Columns in the DataFrame what was the experience the candidate has ?
@GeekCoders
9 ай бұрын
4 years
@2412_Sujoy_Das
9 ай бұрын
My solution is as follows: string = df integer = df float = df for i in df.dtypes: if i[1]!='string' and i[1]=='int': string = string.drop(i[0]) float = float.drop(i[0]) elif i[1]!='string' and i[1]=='float': string = string.drop(i[0]) integer = integer.drop(i[0]) elif i[1]!='int' and i[1]=='string': integer = integer.drop(i[0]) float = float.drop(i[0]) elif i[1]!='int' and i[1]=='float': integer = integer.drop(i[0]) string = string.drop(i[0]) elif i[1]!='float' and i[1]=='string': float = float.drop(i[0]) integer = integer.drop(i[0]) else: float = float.drop(i[0]) string = string.drop(i[0]) print(string) print(integer) print(float)
@SouvikMitul
5 ай бұрын
my solution: dict={} for i in df.dtypes: if i[1] in dict.keys(): l=dict.get(i[1]) l.append(i[0]) dict.update({i[1]:l}) else: l=[] l.append(i[0]) dict.update({i[1]:l}) for i in dict.keys(): df_s=df.select(dict.get(i)) df_s.show() ##did show instead of writing

Tiger Analytics PySpark Interview Question | Very Important Question of PySpark |

Deloitte Interview Question | Find missing Numbers in the DataFrame |

龟兔赛跑：好可爱的小乌龟#short #angel #clown

哈哈大家为了进去也是想尽办法！#火影忍者 #佐助 #家庭

Life hack 😂 Watermelon magic box! #shorts by Leisi Crazy

Сюрприз для Златы на день рождения

TCS Live Interview for Azure Data Engineer | Technical round -1 Azure | KSR DATAVIZON

Most Important Question of PySpark in LTIMindTree Interview Question | Salary in each department |

Validate data between source and target table | Data Engineering Interview | BIG 4 | PWC

Interview: Amazon Data Engineer (Majoring in Computer Science to working as Data Engineer)

Uber Data Engineer Interview: Design a Ride Sharing Schema

Top 10+ Data Engineer Interview Questions and Answers

Solve KPMG Pyspark Interview Questions

Capgemini Interview Experience | Python/Web Developer - GIT/AWS Interview Question Answers | SDE

Processing 25GB of data in Spark | How many Executors and how much Memory per Executor is required.

Azure Databricks Interview Questions And Answers | Azure Databricks Interview | Intellipaat

龟兔赛跑：好可爱的小乌龟#short #angel #clown

Capgemini Data Engineer Interview Question - Round 1 | Save Multiple Columns in the DataFrame |

Пікірлер: 27