StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

Thiago Baldim

Rating
1475.28 (4,510,621st)
Reputation
3,446 (48,938th)
Page: 1 2 3
Title Δ
Spark: Convert hex string to Decimal 0.00
Why is parallel aggregation not faster in spark? 0.00
Databricks Committed_vacuum in AWS S3 0.00
Loading large data with pandas directly from hdfs with pyspark 0.00
Difference in Spark SQL Shuffle partitions +0.05
spark partition strategy comparison between date=dd-mm-yyyy vs yyyy... 0.00
Apache Spark: count vs head(1).isEmpty -2.03
Executor heartbeat timed Out : Error in Spark Job 0.00
Spark - is map function available for Dataframe or just RDD? 0.00
Can Apache Spark be used in place of Sqoop 0.00
Spark executors,partitions out of memory 0.00
What is the purpose of data types in (Py)Spark? 0.00
hadoop copying the result from hdfs to S3 0.00
what is driver memory and executor memory in spark? 0.00
Which situation is it better to use coalesce vs repartition 0.00
Is there any way to optimise this code that uses pandas to read a T... 0.00
Kafka Partition ordering guarantee 0.00
Kafka Streams KTable to Stream INVALID_TOPIC_EXCEPTION 0.00
jackson/guava jar conflict when run spark on yarn 0.00
How do I run Spark jobs concurrently in the same AWS EMR cluster ? +0.53
Couchbase Java DCP client doesn't start the load from all bucke... 0.00
Relevance of Hadoop & Streaming solutions when Spark exists -1.58
reduceByKey and lambda 0.00
Pull data from RDS MySQL db using pyspark 0.00
Should I Avoid groupby() in Dataset/Dataframe? 0.00
Memory configurations 0.00
Spark Submit Configuration while running parallel jobs in EMR 0.00
spark, scala & jdbc - how to limit number of records +2.77
how does apache spark process the non-rdd likes System.out, for, wh... 0.00
Initializing SparkContext inside another SparkContext object +4.07
Is there a reference of Spark Log4j properties? 0.00
Spark job fails when cluster size is large, succeeds when small -3.66
Pyspark Memory Issue 0.00
Spark EMR S3 Processing Large No of Files 0.00
Processing json much slower than csv with multiple cores 0.00
Kafka Structured Streaming error 0.00
Will a Spark job write to or read from the local file system? 0.00
Exception with Table identified via AWS Glue Crawler and stored in... 0.00
Increase or decrease partitions for an aggregation? 0.00
How should I set parameters "spark.kryoserializer.buffer.mb&qu... 0.00
How to use the same spark context in a loop in Pyspark 0.00
how to use pyspark to read orc file 0.00
Is garbage collection time part of execution time of a task in apac... 0.00
Doubts related to Spark resource usage 0.00
Apache Kafka + Spark Integration (REST API is needed?) 0.00
1 Billion records join(Filters) in Spark with Parquet file format v... 0.00
Loading a spark dataframe into Hive partition 0.00
Consume a big data by Kafka and Spark 0.00
Spark in memory when using SparkSession with enableHiveSupport 0.00
string manipulation for column names in pyspark 0.00