- Newest
- Most votes
- Most comments
Hello.
I think it is possible to use a web crawler by configuring RAG by combining Amazon Kendra and Bedrock.
https://aws.amazon.com/jp/blogs/machine-learning/index-your-web-crawled-content-using-the-new-web-crawler-for-amazon-kendra/
https://diptimanrc.medium.com/amazon-kendra-web-crawler-content-search-for-internet-and-intranet-401325c8b4ed
https://community.aws/content/2eBsTWhvFFPUtzh2secQlNRBgta/prototype-a-rag-chatbot-with-amazon-bedrock-kendra-and-lex
Another approach is to use Opensearch serverless and Bedrock (without Knowledge Base) : https://github.com/aws-samples/rag-using-langchain-amazon-bedrock-and-opensearch
You can consider to use web crawler as Riku_kobayasi said but before choosing an approach, consider factors like the size and complexity of your website, the frequency of updates to the content, and your budget for hosting and computation costs. Additionally, make sure to comply with any legal or ethical considerations related to web scraping and data usage.
Relevant content
- asked 4 years ago
- Accepted Answerasked 7 months ago
- asked 3 months ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated 6 months ago
- AWS OFFICIALUpdated a month ago
- AWS OFFICIALUpdated a year ago
The blog that we create are like 1 per week.