Hi @JohnAziz
Thanks a lot for sharing this interesting walkthrough.
On my side, I realized a POC with the integration of data coming from a website. The goal being to chat and ask question in regards to the content of this website.
The flow is the following
- For each URL of the website
- store the url name in a csv file (1.csv to xxx.csv)
- extract the content of the page and integrate it in the same csv file
- send the csv file to Azure Blob Container
- Run the indexer of azure search
I integrate more than 7000 text files
The vectorization works very well and the chat produces good result
But in the citation the content is not really good and the title of the citation takes the name of the text file (xxx.csv) which is not the url of the original page.
According to your experience, is there a way to add a new field to the azure search index with the URL name ?
Thanks a lot for your answer.
cheers