Unable to decompose nested JSON in Spark dataframe
I am new to spark. I'm trying to flatten the dataframe but am not able to do it via "explode".
The original data frame structure is as follows:
id|approvaljson 1|[{"approvertype":"1st line manager","status":"approved"},{"approvertype":"2nd line manager","status":"approved"}] 2|[{"approvertype":"1st line manager","status":"approved"},{"approvertype":"2nd line manager","status":"rejected"}]
Do I need to convert it to the following schema?
id|approvaltype|status 1|1st line manager|approved 1|2nd line manager|approved 2|1st line manager|approved 2|2nd line manager|rejected
I've tried it
df_exploded = df.withcolumn("approvaljson", explode("approvaljson"))
But I got the error:
Cannot resolve "explode(ApprovalJSON)" due to data type mismatch: parameter 1 requires ("ARRAY" or "MAP") type, however, "ApprovalJSON" is of "STRING" type.;
Correct answer
First parse the json-like string into an array of structures, then use inline
to break the array into rows and columns
df1 = df.withcolumn("approvaljson", f.from_json("approvaljson", schema="array<struct<approvertype string, status string>>")) df1 = df1.select("id", f.inline('approvaljson'))
result
df1.show() +---+----------------+--------+ | ID| ApproverType| Status| +---+----------------+--------+ | 1|1st Line Manager|Approved| | 1|2nd Line Manager|Approved| | 2|1st Line Manager|Approved| | 2|2nd Line Manager|Rejected| +---+----------------+--------+
The above is the detailed content of Unable to decompose nested JSON in Spark dataframe. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



I have a jwt token and I can see the decoded token on the https://jwt.io/ website. It doesn't require me to set any secrets or claims. So I'm looking for a way to decode the token to get the expiration date without providing any secret. I'm using the library ngopkg.in/square/go-jose.v2/jwt, below is my code: token,err:=jwt.ParseSigned(jwtToken) return value token has a header field which includes keyid, algorithm, But it doesn't give me expiry date. I have searched this topic and people said to use github.com/a

The parse function parses a string, converting it into a list of tokens separated by a delimiter. Steps: 1. Search for the first non-delimiter character from the beginning of the string; 2. Continue searching until a delimiter is encountered and terminate the string at that delimiter; 3. Store the token in the token array; 4 . Repeat steps 1-3 until the end of the string; 5. Add a pointer to NULL at the end of the array to indicate the end of the array.

I have a use case where we are getting nested key value in x-www-form-urlencoded body like below name=abc&age=12¬es[key1]=value1¬es[key2]=value2 I tried url.parsequery("name= abc&age=12¬es\[key1\]=value1¬es\[key2\]=value2") but it gives {"name":"abc","age":12,"notes[key1]":"value1", "note

In C++, there are two ways to convert a string to an int: using the sto i() function, which receives the string directly and returns an integer. Use the isringstream class to parse the string into an input stream and extract the integers. The method chosen depends on the string format: stoi() is more concise if the format is unambiguous and has no non-numeric characters; isstringstream is more flexible if the string may contain non-numeric characters or requires custom conversion.

Parse in Java refers to the process of converting a string or other representation into a specified type or object. Common applications include converting strings to numeric types, date/time objects, JSON objects, and extracting data from XML documents. Parse via built-in methods, formatter classes, or third-party libraries.

The key functions for parsing addresses in the Go language include: net.ParseIP(): Parse IPv4 or IPv6 addresses. net.ParseCIDR(): Parse CIDR tags. net.ResolveIPAddr(): Resolve hostname or IP address into IP address. net.ResolveTCPAddr(): Resolve host names and ports into TCP addresses. net.ResolveUDPAddr(): Resolve host name and port into UDP address.

In-depth understanding of PHP: Implementation method of converting JSONUnicode to Chinese During development, we often encounter situations where we need to process JSON data, and Unicode encoding in JSON will cause us some problems in some scenarios, especially when Unicode needs to be converted When encoding is converted to Chinese characters. In PHP, there are some methods that can help us achieve this conversion process. A common method will be introduced below and specific code examples will be provided. First, let us first understand the Un in JSON

Go language reflection allows you to manipulate variable values at runtime, including modifying Boolean values, integers, floating point numbers, and strings. By getting the Value of a variable, you can call the SetBool, SetInt, SetFloat and SetString methods to modify it. For example, you can parse a JSON string into a structure and then use reflection to modify the values of the structure fields. It should be noted that the reflection operation is slow and unmodifiable fields cannot be modified. When modifying the structure field value, the related fields may not be automatically updated.
