StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

Ramzy

Rating
1487.33 (4,451,460th)
Reputation
4,330 (38,594th)
Page: 1 2 3 ... 6
Title Δ
How spark manages IO perfomnce if we reduce the number of cores per... 0.00
Spark Structured Streaming Print Offsets Per Batch Per Executor 0.00
Read from Hbase + Convert to DF + Run SQLs 0.00
Most efficient way to select and process data from a dataframe 0.00
Spark Streaming - Restarting from checkpoint replays last batch +0.52
HBase schema design correct? +2.07
Structured Streaming Kafka Source Offset Storage 0.00
Spark : How to make calls to database using foreachPartition +0.02
For Hbase, is there a function like EXPLAIN in MySQL? 0.00
spark-submit how to specify that dependent libraries are inside app... +0.02
Save a RDD by saveAsObject, Exception "had a not serializable... 0.00
Spark-submit cannot access local file system -0.53
Pyspark read multiple csv files into a dataframe (OR RDD?) 0.00
Reading massive JSON files into Spark Dataframe -0.48
Using pig to store data to Hbase 0.00
Spark 1.6.3 rdd.foreach with a broadcast variable cost too much time -0.23
I am trying to understand given log generated by Spark Program 0.00
Hbase composite key and aggregated rows 0.00
Delete value with HBase-Hive integration 0.00
Best practice to run multiple spark instance at a time in same jvm? +0.52
Records processed metric for intermediate datasets 0.00
Spark Streaming : source HBase 0.00
Scala 2.11 Spark 2.0 hortonworks-spark/shc sbt assemby +0.05
Spark scala coding standards 0.00
For distributing calculation task, which is better celery or spark +0.02
Setting Driver manually in Spark Submit over Yarn Cluster +0.01
Is it necessary to set partition number everywhere? spark 0.00
Is there any concept of auto commit in hbase? -0.49
Spark DataFrame map error 0.00
Split Spark Dataframe to each row and convert to JSON - Python 0.00
Can I use groupByKey in each partition of RDD? or how can I find th... 0.00
Hbase for real-time application 0.00
Which HBase connector for Spark 2.0 should I use? -0.48
Last Reducer is running from last 24 hour for 200 gb of data set -0.42
Spark-Scala connection 0.00
For spark applications running on YARN, which deploy mode is better... +0.52
More efficient way to loop through PySpark DataFrame and create new... -0.44
Spark Health check scripts 0.00
What is an efficient way to partition by column but maintain a fixe... +0.02
Reduce computational time in Spark application 0.00
How to tune spark executor number, cores and executor memory? 0.00
Spark: some general best practices for generic "out of memory&... -0.51
Standalone Cluster Mode: how does spark allocate spark.executor.cor... -0.01
Spark + Yarn: How to retain logs of lost-executors 0.00
Is it inefficient to manually iterate Spark SQL data frames and cre... 0.00
(Spark/Scala) What would be the most effective way to compare speci... 0.00
Monitoring Apache Spark Logs and the Dynamic App/Driver logs 0.00
Spark - HiveContext | Wrong Timestamps (substracts 4 hours) 0.00
How to convert PythonRDD (of lines in JSONs) to DataFrame? -0.21
Convert Xml to Avro from Kafka to hdfs via spark streaming or flume 0.00