pablo
|
50a56091b9
|
Added missing logger import.
|
2021-01-06 10:38:30 +01:00 |
|
Pablo Martin
|
3740ab2ada
|
Merge branch 'fix/dead_ad_string' into 'integration'
Fix/dead ad string
See merge request pablomartincalvo/Drogon!2
|
2021-01-04 21:29:52 +00:00 |
|
pablo
|
639de7c602
|
Change strings to look for in HTML. Chores.
|
2021-01-04 22:29:01 +01:00 |
|
Pablo Martin
|
6122f74e99
|
Merge branch 'refactor/capturer_improved' into 'integration'
Refactor/capturer improved
See merge request pablomartincalvo/Drogon!1
|
2021-01-04 21:23:16 +00:00 |
|
pablo
|
cbf1643fb5
|
Formatting, docstrings and other chores.
|
2021-01-04 22:17:40 +01:00 |
|
pablo
|
adf2cd26ba
|
Minor fix regarding issue spotting in parsing.
|
2021-01-04 21:56:24 +01:00 |
|
pablo
|
cf4ce06b57
|
Implemented tests for CapturingTask. A few mock classes where needed.
|
2021-01-03 20:06:28 +01:00 |
|
pablo
|
007f458cd5
|
Minor fixes.
|
2021-01-03 20:05:34 +01:00 |
|
pablo
|
e34a34acaf
|
Fix in throttling test so it doesn't fail around midnight.
|
2021-01-02 23:49:10 +01:00 |
|
pablo
|
def858ef6a
|
Modified input format of instructions for ParsingFlowGenerator. Previous dict wouldn't allow for more than one SecondaryFeaturesFieldInstructions class pointer.
|
2020-12-31 19:02:09 +01:00 |
|
pablo
|
2b249063e0
|
Created a new flow generator + tests for it.
|
2020-12-31 18:28:48 +01:00 |
|
pablo
|
b8d4893026
|
Mini syntax fix.
|
2020-12-31 18:14:44 +01:00 |
|
pablo
|
cb553b5f7e
|
Minor fixes in parsing utils.
|
2020-12-29 20:42:21 +01:00 |
|
pablo
|
3b79ba06d8
|
Created parsing_utils module to refactor HTML parsing and validation actions.
|
2020-12-29 17:38:17 +01:00 |
|
pablo
|
3f9a6d8e53
|
Integrated throttling in capturer.
|
2020-12-27 12:35:02 +01:00 |
|
pablo
|
d136144a4e
|
Throttling checks are now lazy.
|
2020-12-26 20:25:56 +01:00 |
|
pablo
|
2a9483981e
|
Implemented a new throttling module to remove redundance in the project.
|
2020-12-26 18:54:04 +01:00 |
|
pablo
|
f207dd5dda
|
Started integration branch.
|
2020-12-26 12:12:44 +01:00 |
|
pablo
|
0086cf2b4c
|
Improved logging in refresher.py
|
2020-11-15 13:21:08 +01:00 |
|
pablo
|
e939d67467
|
Improvements in listing page URL generation.
|
2020-11-15 12:54:17 +01:00 |
|
pablo
|
a61fac72f7
|
Typing, docstrings and formatting of explorer.py
|
2020-11-03 21:55:09 +01:00 |
|
pablo
|
f53a65834b
|
Turned method static.
|
2020-11-03 14:00:51 +01:00 |
|
pablo
|
43236c2884
|
Typing, docstrings, formatting for capturer.py
|
2020-11-03 13:50:36 +01:00 |
|
pablo
|
3cf7dd8bd9
|
Typing, docstrings, formatting for mysql_wrapper.py
|
2020-11-03 08:44:37 +01:00 |
|
pablo
|
e9ee23f852
|
Typing, docstrings, formatting for scrapping_utils.py
|
2020-11-03 07:43:21 +01:00 |
|
pablo
|
a79fc533ee
|
Formatting.
|
2020-11-03 07:29:17 +01:00 |
|
pablo
|
cd9c3b6e39
|
Some changes.
|
2020-11-03 07:26:06 +01:00 |
|
pablo
|
9e7194c8d9
|
URLAttacks now share a common session.
|
2020-11-02 13:08:37 +01:00 |
|
pablo
|
db04a67c4c
|
More testing code.
|
2020-11-02 12:51:20 +01:00 |
|
pablo
|
c337a33feb
|
More testing code.
|
2020-11-02 12:43:49 +01:00 |
|
pablo
|
81112a4cb9
|
More testing code.
|
2020-11-02 12:02:56 +01:00 |
|
pablo
|
51c4bdb347
|
Fixes. Code version for mysql.
|
2020-05-08 09:26:29 +02:00 |
|
pablo
|
596aaa1393
|
.
|
2020-05-05 11:36:28 +02:00 |
|
pablo
|
8d4c082a18
|
Format.
|
2020-04-26 15:06:04 +02:00 |
|
pablo
|
923649a099
|
Format. Random headers
|
2020-04-26 15:05:40 +02:00 |
|
pablo
|
af11a2e87f
|
Weird waiting distribution implemented
|
2020-04-26 14:54:27 +02:00 |
|
pablo
|
df032328e9
|
Formatting and todos.
|
2020-04-25 18:26:22 +02:00 |
|
pablo
|
f0fe2b9780
|
Updated headers.
|
2020-04-25 18:17:43 +02:00 |
|
pablo
|
c8ea77e99a
|
Logging en explorer.py
|
2020-03-26 11:47:12 +01:00 |
|
pablo
|
acfeeef0d1
|
Formatting
|
2020-03-26 11:38:08 +01:00 |
|
pablo
|
9c2565f5d8
|
Logging and formatting
|
2020-03-26 11:37:32 +01:00 |
|
pablo
|
cdbb6b5325
|
Added logging to geocoder.py
|
2020-03-26 11:30:09 +01:00 |
|
pablo
|
a9242b2f3a
|
Added logging config.
|
2020-03-26 11:18:14 +01:00 |
|
pablomartincalvo
|
812bb66219
|
Refactor en geocoder para evitar error con respuestas sin resultados.
|
2019-02-09 17:25:27 +01:00 |
|
pablomartincalvo
|
5ec97ad008
|
Pequeños refactorings.
|
2019-01-07 18:09:52 +01:00 |
|
pablomartincalvo
|
227f298d8b
|
Eliminados prints innecesarios del capturer.
|
2019-01-03 19:32:40 +01:00 |
|
pablomartincalvo
|
38984822a7
|
Actualizados headers de los ataques a URL por cambios en idealsita.
|
2018-12-30 19:28:05 +01:00 |
|
pablomartincalvo
|
ed32b15bc1
|
Cambios en validacion del parser
|
2018-12-30 12:06:23 +01:00 |
|
pablomartincalvo
|
98165ce8f0
|
Cambios en validacion del parser
|
2018-12-29 11:37:43 +01:00 |
|
pablomartincalvo
|
368f8a00bb
|
Merge branch 'dev'
|
2018-12-25 18:54:06 +01:00 |
|