Skip to content

Update load_csv.py#229

Open
ryan-mangeno wants to merge 2 commits into
talkdai:mainfrom
ryan-mangeno:patch-1
Open

Update load_csv.py#229
ryan-mangeno wants to merge 2 commits into
talkdai:mainfrom
ryan-mangeno:patch-1

Conversation

@ryan-mangeno

Copy link
Copy Markdown

Redundant CSV Loading: The main issue with the original approach is that the CSV is being loaded twice — once for metadata_columns and once for NECESSARY_COLS. This leads to repeated IO operations and potential inefficiency. Since the columns are being merged in the same function, you can load the CSV once and manage the metadata combination more efficiently.

Instead of I loaded all metadata columns in one with loader_metadata = CSVLoader(path, metadata_columns=NECESSARY_COLS + metadata_columns)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant