StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

Oli

Rating
1536.77 (13,523rd)
Reputation
1,308 (122,957th)
Page: 1 2 3 ... 6
Title Δ
Generate multiple outputs from a dataframe, reading data only once 0.00
How to map a column to create a new column in spark sql dataframe? 0.00
How to transpose many files Given that linesWithFileNames: RDD[(Pat... 0.00
How to apply the minDF feature extractor CountVectorizer parameter... 0.00
Get first example element from filtered aggregation pySpark +2.08
Find maximum average using Dataframe API in spark +0.09
Spark 2.3: Reading dataframe inside rdd.map() 0.00
How to apply conditional expression filter on spark dataframe where... 0.00
How to add new Column in pyspark and insert multiple values with ba... 0.00
is it possible to take a single dataframe row and split it up to mu... 0.00
Spark not running RDD in Parallel Pyspark with binaryfile 0.00
Populate dataset with missing dates (in days) with scala +0.08
Pyspark: Unable to turn RDD into DataFrame due to data type str ins... 0.00
fetch year, month, day from string PySpark 0.00
apache spark graphx - create VertexRDD from sql table 0.00
Want to transpose the columns in spark scala 0.00
Read spark csv with empty values without converting to null 0.00
Scala/Spark - Convert Word2vec output to Dataset[_] -0.05
efficient symmertic computation in spark 0.00
Spark window partition function taking forever to complete +1.23
Spark window partition function taking forever to complete -0.27
Spark Scala input empty values according result from self joined da... 0.00
hive sql join different table as array of structs 0.00
Spark 2 converting scala array to WrappedArray 0.00
Spark collect_list change data_type from array to string +0.45
Spark IllegalArgumentException: Column features must be of type str... 0.00
Spark : need confirmation on approach in capturing first and last d... 0.00
While Reading CSV, last column is coming as Null in Spark, Scala -0.04
Why do I get a type mismatch error when using a UDF that returns an... 0.00
Compare rows of an array column with the headers of another data fr... 0.00
Pyspark: Split a single column with multiple values into separate c... 0.00
How to efficiently join large pyspark dataframes and small python l... 0.00
Creating a Pyspark data frame with variable schema +0.45
Difference between approxCountDsitinct and approx_count_distinct in... 0.00
pyspark do calculation for each row against other rows and get max 0.00
error: value orderBy is not a member of org.apache.spark.sql.Relati... 0.00
Spark DataFrame: Add a new columns according to other columns 0.00
Reading gzipped parquet files from spark 0.00
Finding Percentile in Spark-Scala per a group +0.45
Spark SQL: get the value of a column when another column is max val... +1.19
IllegalArgumentException: Column must be of type struct<type:tin... +2.35
How do I efficiently map keys from one dataset based on values from... 0.00
Pyspark - How use a function with two arguments taken from my rdd 0.00
Scala; recursively walk all directories in parent Hadoop directory +0.45
Pivot in spark scala -0.55
How can I efficiently join a dataframe in spark with a directory of... 0.00
Custom output file format write with Spark +0.46
Quotes not displayed in CSV output file -0.03
how to apply flatMapToPair on a given rdd? 0.00
Locality Sensitive Hashing in Spark for single DataFrame +0.46