StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

Avishek Bhattacharya

Rating
1501.27 (403,371st)
Reputation
2,742 (61,579th)
Page: 1 2 3 4
Title Δ
Spark and Tika for pdf parsing 0.00
Spark Dataframe complex ordering 0.00
Spark count dataframe to estimate output partitions, then write, ef... 0.00
Killing oozie coordinator is not killing the subsequent spark job 0.00
Deduplicate Spark Dataframe by Field -0.58
Union two DataFrame using spark 2.x with different schema/dataTypes 0.00
Task not serializable - Java 1.8 and Spark 2.1.1 +0.48
Spark: efficiency of dataframe checkpoint vs. explicitly writing to... 0.00
Write Partition with Date column Java-Spark +0.49
How to handle this obscure error when doing a join in spark? 0.00
Where to set "spark.yarn.executor.memoryOverhead" 0.00
How to pass multiple column in partitionby method in Spark 0.00
apache spark executors and data locality 0.00
why is .write.partitionBy().sortBy().saveAsTable() producing a much... 0.00
drop table command is not deleting path of hive table which was cre... 0.00
Dataproc master node configuration 0.00
Spark: Data can't fit in memory and I want to avoid write it in... 0.00
I am facing issues in running a code in RStudio via Spark "Err... 0.00
Create Spark SQL tables from multiple parquet paths +0.49
Spark 2.3 Dropping Temp Table 0.00
Does Spark saveAsTable roll back when the session is killed during... 0.00
Best practice for writing to hadoop from spark +0.47
spark.executor.extraJavaOptions ignored in spark-submit 0.00
Apache Spark page results or view results on large datasets +0.48
Comparing DataFrames in Spark 0.00
how to tune up spark Jobs on a cluster with different amount of mem... +0.50
Joining a large and a massive spark dataframe 0.00
Coalesce reducing JDBC read parallelism -0.48
spark error:java.lang.IllegalArgumentException: Size exceeds Intege... +4.01
How does RDD coalesce work 0.00
Spark: 'Requested array size exceeds VM limit' when writing... 0.00
Encounter SparkException "Cannot broadcast the table that is l... +4.05
Spark Geolocated Points Clustering -3.99
Spark: does DataFrameWriter have to be a blocking step? 0.00
What is the benefit of using nested data types in Parquet? +3.70
Why SPARK repeat transformations after persist operations? +3.50
EMR Spark job using less executors than nodes in the cluster 0.00
Should we create separate dataframe for each table in a join query... -0.09
Order of Growth Analysis +4.67
What to use to read/write from dynamodb from Spark? 0.00
Convert date format in Scala +0.01
Is my understanding of spark partitioning correct? 0.00
Strategy pattern vs Inheritance -1.47
Why spark creates empty partitions and how default partitioning work? 0.00
Spark' Dataset unpersist behaviour 0.00
how to get the big O (worst case) recursive nCr 0.00
Rename and Move S3 files based on their folders name in spark scala -3.74
Is there a size limit for Spark's RDD 0.00
Divide operation in spark using RDD or dataframe 0.00
Too many tasks in spark on Yarn 0.00