StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

Zhang Tong

Rating
1509.81 (71,984th)
Reputation
2,133 (78,935th)
Page: 1 2 3
Title Δ
Reduce PythonRDD of lists into one list -1.22
str.decode() does not work for Chinese characters in pyspark 0.00
ERROR:SparkContext can only be used on the driver, not in code that... 0.00
Write Rows in Spark Dataframe to Separate Directories in HDFS (pysp... 0.00
How to process files with file names in records from Kafka using Ja... -4.01
how to use pyspark saveAsTextFile deal Chinese characters 0.00
Process data per group in Hive with PySpark 0.00
Spark: how to parse empty string values as null in json 0.00
Spark Accumulator confusion 0.00
Aggregating List of Dicts in Spark DataFrame 0.00
How to reference an aliased aggregate function applied to a column... 0.00
Pyspark appcache too big 0.00
ReduceByKey Function - Spark Python 0.00
Extracting specific lines from a Large file +3.39
Spark program is not printing Hive Database or Table list -0.33
Apache Spark custom aggregation function 0.00
Spark error for python program "java.lang.OutOfMemoryError: Ja... 0.00
How to remove missing values in Pyspark +0.44
Spark 2.0 - Flatten JSON file to a CSV 0.00
Writing to a file from inside worker in spark local mode doesn'... -1.72
How do I add a new column to a Spark DataFrame from function value... 0.00
How to add a python list to spark dataframe? -0.01
python udf to calculate julian date from julian day 0.00
spark - compare key with values -4.08
Controlling Spark Streaming of the Files -0.34
pyspark Change the value of a column before using groupby on that c... +3.87
What happens when Spark reads multiple parquet files which differ i... 0.00
pyspark streaming restore from checkpoint 0.00
Possible to get output from Spark App submitted in cluster mode? 0.00
PySpark: use one column to index another (udf of two columns?) 0.00
How can i convert a timestamp to gmt format in hive +3.81
How do I preprocess JSON data before loading into Spark dataframe 0.00
Count Duplicates Values within a time interval in PySpark 0.00
Spark Installation and Configuration on MacOS ImportError: No modul... -0.13
PySpark speed Ubuntu vs Windows 0.00
Remove element from PySpark DataFrame column 0.00
Creating a new rdd from another rdd in Python -0.10
Querying json object in dataframe using Pyspark 0.00
Load an RDD into hive 0.00
RDD creation and variable binding +4.03
How can I catch the log output of pyspark foreachPartition? 0.00
How to append to a csv file using df.write.csv in pyspark? +3.94
What is use of "spark.streaming.blockInterval" in Spark S... 0.00
Pyspark rdd Transpose 0.00
SPARK Steaming updateStateByKey values from previous Microbatch whe... 0.00
Pyspark 'PipelinedRDD' object has no attribute 'show' 0.00
normal RDD in spark streaming and add or remove data into this RDD 0.00
How to read concurrently from each Kafka partition in Spark Streami... +0.31
% wordcount in SPARK STREAMING (PYTHON) 0.00
How can I connect to 2 kafka topics at a time, but process only 1 a... +0.01