Skip to content

Column does not exist in BatchData when column name has a dot (.) - Spark only #11199

@larruda

Description

@larruda

Describe the bug
I'm using a SparkFileSystemDatasource with a CSVAsset (header=True). The csv file has a header line and some column names have dots (e.g. Data.Entrega). GX is not able to identify those columns and throw "The column "Data.Entrega" in BatchData does not exist"

To Reproduce
It's an ephemeral context, also I'm not allowed to share any code due to NDA with customer. It's simple as I described, reading a csv asset from a spark datasource with header=true and dots in the col names, simple as that anyone should be able to easily reproduce.

Expected behavior
The column should be found.

Environment (please complete the following information):

  • Operating System: MacOS
  • Great Expectations Version: 1.4.5
  • Data Source: Spark CSV file
  • Cloud environment: N/A

Additional context
Add any other context about the problem here.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugBugs bugs bugs!

    Type

    No type

    Projects

    Status

    To Do

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions