redshift_connector Driver throws error while running SELECT 1 (or any query) #20137

niravpeak · 2022-05-20T07:20:23Z

A clear and concise description of what the bug is.
We were using driver redshift+psycopg2 , that worked well so far. As part of enhanced security we moved from that driver to redshift+redshift_connector driver. Although it does successful connect & dropdown of tables. it is unable to display correct dataset & getting mentioned error below:

How to reproduce the bug

install redshift_connector
bootstrapScript: |
#!/bin/bash
rm -rf /var/lib/apt/lists/* &&
pip install
psycopg2-binary==2.9.1
redis==3.5.3
sqlalchemy-redshift==0.8.9
redshift-connector==2.0.907 &&
if [ ! -f ~/bootstrap ]; then echo "Running Superset with uid {{ .Values.runAsUser }}" > ~/bootstrap; fi
Go to 'databases' => Add database using username / password or IAM based. test connect
Go to sql editor page
select database & relevant schema.
On editor type "SELECT 1"
See below error:
First element of field tuple is neither a tuple nor str

Expected results

Output with data 1
what you expected to happen.

Actual results

what actually happens.
On pod we are getting below error trace:
raceback (most recent call last):
File "/app/superset/views/base.py", line 207, in wraps
return f(self, *args, **kwargs)
File "/app/superset/utils/log.py", line 245, in wrapper
value = f(*args, **kwargs)
File "/app/superset/views/core.py", line 2393, in sql_json
command_result: CommandResult = command.run()
File "/app/superset/sqllab/command.py", line 104, in run
raise ex
File "/app/superset/sqllab/command.py", line 96, in run
status = self._run_sql_json_exec_from_scratch()
File "/app/superset/sqllab/command.py", line 138, in _run_sql_json_exec_from_scratch
raise ex
File "/app/superset/sqllab/command.py", line 133, in _run_sql_json_exec_from_scratch
return self._sql_json_executor.execute(
File "/app/superset/sqllab/sql_json_executer.py", line 111, in execute
raise SupersetErrorsException(
superset.exceptions.SupersetErrorsException: [SupersetError(message='First element of field tuple is neither a tuple nor str', error_type=<SupersetErrorType.GENERIC_DB_ENGINE_ERROR: 'GENERIC_DB_ENGINE_ERROR'>, level=<ErrorLevel.ERROR: 'error'>, extra={'engine_name': 'Amazon Redshift', 'issue_codes': [{'code': 1002, 'message': 'Issue 1002 - The database returned an unexpected error.'}]})]

Screenshots

If applicable, add screenshots to help explain your problem.

Environment

(please complete the following information):

browser type and version: chrome:
superset version: superset version: Superset 0.0.0dev
python version: python --version: Python 3.8.12
node.js version: node -v:
any feature flags active: [ installed with helm chart ]
helm install superset . --values=values.yaml -n superset-experiment

Checklist

Make sure to follow these steps before submitting your issue - thank you!

I have checked the superset logs for python stacktraces and included it here as text if there are any.
I have reproduced the issue with at least the latest released version of superset.
I have checked the issue tracker for the same issue and I haven't found one similar.

Additional context

Add any other context about the problem here.
Few more details about debug trace from redshift_connector can be seen as below:
2022-05-17 05:50:12,363:DEBUG:redshift_connector:===================================
2022-05-17 05:50:12,363:DEBUG:redshift_connector.cursor:Cursor.paramstyle=named
2022-05-17 05:50:12,363:DEBUG:redshift_connector.core:===================================
2022-05-17 05:50:12,363:DEBUG:redshift_connector.core:Establishing a connection
2022-05-17 05:50:12,363:DEBUG:redshift_connector.core:{'user': 'IAM:uksegmentexplorer', 'database': 'dev', 'application_name': 'sqlalchemy-redshift', 'replication': None, 'client_protocol_version': '2', 'driver_version': 'Redshift Python Driver 2.0.907', 'os_version': 'Linux-5.4.181-99.354.amzn2.x86_64-x86_64-with-glibc2.2.5'}
2022-05-17 05:50:12,363:DEBUG:redshift_connector.core:===================================
2022-05-17 05:50:12,369:DEBUG:redshift_connector.cursor:Cursor.paramstyle=format
2022-05-17 05:50:12,369:DEBUG:redshift_connector.core:Sending start-up message
2022-05-17 05:50:12,551:DEBUG:redshift_connector.core:Server indicated EXTENDED_RESULT_METADATA transfer protocol will be used rather than protocol requested by client: BINARY
2022-05-17 05:50:12,551:DEBUG:redshift_connector.cursor:Cursor.paramstyle=format
2022-05-17 05:50:12,559:DEBUG:redshift_connector.core:field count=1

The text was updated successfully, but these errors were encountered:

rusackas · 2022-05-25T15:23:47Z

@eschutho do you think this change in driver is something we should look at and support?

Brooke-white · 2022-06-07T17:25:49Z

Hi @rusackas & @eschutho , I maintain redshift_connector. We haven't tested integration with superset, so I'm new to the superset codebase. Is this issue something you folks have seen in the past?

I'm working on reproducing this issue locally and will update any more details I find here.

eschutho · 2022-06-08T21:32:29Z

psycopg2 is the default driver for redshift on Superset, so it's definitely likely that we would need to make some small adjustments to use the redshift_connector driver. If we were to support it natively, I believe right now we would have to create a new db engine spec for redshift+redshift_connector. @betodealmeida do you have any additional insight on this?

Brooke-white · 2022-06-08T21:46:33Z

please let me know if there is anything needed from redshift_connector to support this

betodealmeida · 2022-06-09T22:42:12Z

@Brooke-white the cursor description returned by theredshift_connector has the column name as bytes, and we expect it to be a string. For example:

[
    (b"date", 1043, None, None, None, None, None),
    (b"open", 701, None, None, None, None, None),
    (b"high", 701, None, None, None, None, None),
    (b"low", 701, None, None, None, None, None),
    (b"close", 701, None, None, None, None, None),
    (b"adj close", 701, None, None, None, None, None),
    (b"volume", 20, None, None, None, None, None),
]

There's no standard type for the column name (https://peps.python.org/pep-0249/#cursor-attributes), but in all DB API 2.0 drivers I've seen it's returned as a string, so you might want to change that. I'll also update Superset so that it works with bytes, just in case.

Brooke-white · 2022-06-14T16:28:36Z

Thanks for the heads up @betodealmeida -- we are working with @niravpeak to determine what a migration path looks like for retrieving column names as strings

eschutho · 2022-06-24T23:59:52Z

I'm going to close this ticket given @betodealmeida's fix, but feel free to comment if you feel that action was premature.

niravpeak added the #bug Bug report label May 20, 2022

betodealmeida self-assigned this Jun 9, 2022

betodealmeida mentioned this issue Jun 9, 2022

fix: ensure column name in description is string #20340

Merged

9 tasks

niravpeak mentioned this issue Jun 13, 2022

Unable to respond with SQLAlchemy driver / Superset code base aws/amazon-redshift-python-driver#105

Closed

eschutho closed this as completed Jun 24, 2022

Paul-office mentioned this issue Sep 2, 2022

Redshift Connector driver issue (2.0.908) aws/amazon-redshift-python-driver#126

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

redshift_connector Driver throws error while running SELECT 1 (or any query) #20137

redshift_connector Driver throws error while running SELECT 1 (or any query) #20137

niravpeak commented May 20, 2022

rusackas commented May 25, 2022

Brooke-white commented Jun 7, 2022

eschutho commented Jun 8, 2022

Brooke-white commented Jun 8, 2022

betodealmeida commented Jun 9, 2022 •

edited

Loading

Brooke-white commented Jun 14, 2022

eschutho commented Jun 24, 2022

redshift_connector Driver throws error while running SELECT 1 (or any query) #20137

redshift_connector Driver throws error while running SELECT 1 (or any query) #20137

Comments

niravpeak commented May 20, 2022

How to reproduce the bug

Expected results

Actual results

Screenshots

Environment

Checklist

Additional context

rusackas commented May 25, 2022

Brooke-white commented Jun 7, 2022

eschutho commented Jun 8, 2022

Brooke-white commented Jun 8, 2022

betodealmeida commented Jun 9, 2022 • edited Loading

Brooke-white commented Jun 14, 2022

eschutho commented Jun 24, 2022

betodealmeida commented Jun 9, 2022 •

edited

Loading