@Generated(value="software.amazon.awssdk:codegen") public final class Urls extends Object implements SdkPojo, Serializable, ToCopyableBuilder<Urls.Builder,Urls>
Provides the configuration information of the URLs to crawl.
You can only crawl websites that use the secure communication protocol, Hypertext Transfer Protocol Secure (HTTPS). If you receive an error when crawling a website, it could be that the website is blocked from crawling.
When selecting websites to index, you must adhere to the Amazon Acceptable Use Policy and all other Amazon terms. Remember that you must only use Amazon Kendra Web Crawler to index your own web pages, or web pages that you have authorization to index.
| Modifier and Type | Class and Description |
|---|---|
static interface |
Urls.Builder |
| Modifier and Type | Method and Description |
|---|---|
static Urls.Builder |
builder() |
boolean |
equals(Object obj) |
boolean |
equalsBySdkFields(Object obj) |
<T> Optional<T> |
getValueForField(String fieldName,
Class<T> clazz) |
int |
hashCode() |
List<SdkField<?>> |
sdkFields() |
SeedUrlConfiguration |
seedUrlConfiguration()
Configuration of the seed or starting point URLs of the websites you want to crawl.
|
static Class<? extends Urls.Builder> |
serializableBuilderClass() |
SiteMapsConfiguration |
siteMapsConfiguration()
Configuration of the sitemap URLs of the websites you want to crawl.
|
Urls.Builder |
toBuilder() |
String |
toString()
Returns a string representation of this object.
|
clone, finalize, getClass, notify, notifyAll, wait, wait, waitcopypublic final SeedUrlConfiguration seedUrlConfiguration()
Configuration of the seed or starting point URLs of the websites you want to crawl.
You can choose to crawl only the website host names, or the website host names with subdomains, or the website host names with subdomains and other domains that the web pages link to.
You can list up to 100 seed URLs.
You can choose to crawl only the website host names, or the website host names with subdomains, or the website host names with subdomains and other domains that the web pages link to.
You can list up to 100 seed URLs.
public final SiteMapsConfiguration siteMapsConfiguration()
Configuration of the sitemap URLs of the websites you want to crawl.
Only URLs belonging to the same website host names are crawled. You can list up to three sitemap URLs.
Only URLs belonging to the same website host names are crawled. You can list up to three sitemap URLs.
public Urls.Builder toBuilder()
toBuilder in interface ToCopyableBuilder<Urls.Builder,Urls>public static Urls.Builder builder()
public static Class<? extends Urls.Builder> serializableBuilderClass()
public final boolean equalsBySdkFields(Object obj)
equalsBySdkFields in interface SdkPojopublic final String toString()
Copyright © 2023. All rights reserved.