hugom123 commited on
Commit
d38cd2f
·
verified ·
1 Parent(s): b311bed

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -450
README.md CHANGED
@@ -1,469 +1,42 @@
1
  ---
 
2
  tags:
3
  - sentence-transformers
4
  - sentence-similarity
 
 
5
  - feature-extraction
6
- - generated_from_trainer
7
- - dataset_size:29026
8
- - loss:CosineSimilarityLoss
9
- widget:
10
- - source_sentence: 'Central Bankers, Bureaucratic Incentives, and Monetary Policy:
11
- an Introduction'
12
- sentences:
13
- - The idea that institutional structures, or property rights, affect individual
14
- behavior is not a new one. In fact, this idea has been the focus of much of the
15
- public choice/property rights literature of the past two decades. What is relatively
16
- new is the application of this principle to central bank behavior. Until recently,
17
- economists have tended to analyze central banks in a Utopian framework. The implicit
18
- assumption has been that central banks conduct their operations to maximize the
19
- public interest. Any deviation from the optimal rate of inflation could be attributed
20
- to technical problems associated with unanticipated changes in the money multiplier
21
- or money demand. Because of the assumptions employed, the economist’s role is
22
- that of a technician, one who identifies the precise relationships between the
23
- monetary base, the money supply, and the resulting inflation rate.
24
- - This paper examines empirical evidence of the adaptive learning behavior of speculators
25
- in the 2005–2008 renminbi appreciation episode, and establishes a theoretical
26
- model to explore appreciation policy implications of such a behavior. In our model,
27
- speculators form their expectations about the future appreciation premium adaptively
28
- by extrapolating past appreciation returns into the future. We find that a rapid
29
- appreciation may attract more capital inflows, and the central bank may prefer
30
- a slow appreciation to discourage capital inflows. Simulated results can generate
31
- hump-shaped paths of the appreciation speed, expected appreciation premium, and
32
- capital inflows. In addition, changes in the appreciation speed precede changes
33
- in the expected appreciation premium and capital inflows. These results are consistent
34
- with empirical evidence in the 2005–2008 renminbi appreciation episode.
35
- - This paper makes stochastic projections of the Central Bank of Chile’s (CBCh)
36
- balance sheet (stocks and flows) starting from the actual current negative net
37
- worth. These projections incorporate the effect on the balance sheet of several
38
- macroeconomic variables as well as alternative policy decisions, taking into account
39
- the uncertainty and risks inherent in the economy. The article describes and assesses
40
- the main causes of the present deficit. In the baseline scenario, the deterministic
41
- forecast shows that the Central Bank's net worth will increase slowly to reach
42
- positive values after 25 years. However, in the stochastic simulations the Bank's
43
- net worth will still be negative 25 years from now, with a 69 percent probability.
44
- - source_sentence: New Forms of the Management to Loan Size and the Consequences--A
45
- Factor Analysis of Money Expanding from 1996 to the Second Quarter of 2004
46
- sentences:
47
- - When the nominal return on all public liabilities is allowed to adjust to changing
48
- market conditions, or the central bank has access to unlimited open market operations,
49
- money growth is likely to stimulate output. This is shown in the model used by
50
- Lucas in his Nobel Prize Lecture as an example of the non neutral effects of anticipated
51
- monetary expansions. A rise in net outside assets increases households' incentives
52
- to work through a reallocation of consumption across periods. This result survives
53
- with non interest-bearing cash when the latter does not generate relevant distortions.
54
- - central bank is insolvent if its plans imply a Ponzi scheme on reserves so the
55
- price level becomes infinity. If the central bank enjoys fiscal support, in the
56
- form of a dividend rule that pays out net income every period, including when
57
- it is negative, it can never become insolvent independently of the fiscal authority.
58
- Otherwise, this note distinguishes between intertemporal insolvency, rule insolvency,
59
- and period insolvency. While period and rule solvency depend on analyzing dividend
60
- rules and sources of risk to net income, evaluating intertemporal solvency requires
61
- overcoming the difficult challenge of measuring the present value of seignorage.
62
- - This study argues that financializations is not a phenomenon exclusively associated
63
- with complex innovation in highly developed financial markets. Financialization
64
- also affects countries with 'shallow' financial markets but with a significant
65
- presence of transnational financial actors that become a powerful economic and
66
- political force able to navigate and shape uneven regulatory and institutional
67
- terrains in order to sustain new modes of profit generation. The study distinguishes
68
- two stages in financialization of the Romanian economy. The first, central-bank
69
- dominated stage saw the systemic transformation of firm-bank-state relations that
70
- resulted in the creation of impatient banking and the forestalling of industrial
71
- policy options. The second, dependent financialization, is in turn characterised
72
- by new modes of profit generation for transnational financial actors, interconnectedness
73
- and fragility as the main mechanism of incorporation in European financial structures.
74
- - source_sentence: Central Bank Participation in Currency Options Markets
75
- sentences:
76
- - Abstract This paper investigates the impact of the central bank's liquidity operations
77
- on the financial constraints of the bank-dependent firms. We use the Reserve Bank
78
- of India's liquidity operation called Term Repo Operation (TRO) in the study.
79
- The empirical analysis is based on a large scale firm-level data for the period
80
- 2011–2016 and panel logit estimation method. Our findings indicate that the financial
81
- constraints of the bank-dependent firms have reduced than their counterparts since
82
- the introduction of the operation. We also show that larger firms reap significant
83
- benefit out of TRO than the smaller firms.
84
- - Since 1990, Poland has adopted nearly all possible exchange rate regimes, moving
85
- smoothly from fixed peg to pure floating. At the beginning of the transition period
86
- exchange rate policy was aimed mainly at stabilising the economy. It supported
87
- monetary policy in its fight against inflation, playing the role of a nominal
88
- anchor. Under the stabilisation plan, the Polish authorities decided to fix the
89
- zloty to the US dollar at the level of 9,500 zlotys to one US$ in January 1990.
90
- In that way the government and the central bank delivered a strong nominal anchor
91
- for an economy facing dramatic and abrupt adjustments in the real sector and in
92
- relative prices. At the same time, the zloty became an internally convertible
93
- currency.
94
- - This paper models unemployment as a general equilibrium solution in labor and
95
- capital markets, while the natural rate hypothesis explains unemployment simply
96
- as a partial equilibrium in the labor market. It is shown that monetary policy
97
- can have long-run effects by affecting required returns on capital and investment.
98
- If monetary policy is primarily concerned with maintaining price stability, the
99
- interaction between wage bargaining and the central bank’s credibility as an inflation
100
- fighter becomes a crucial factor in determining employment. Different labor market
101
- institutions condition different monetary policy reactions. With centralized wage
102
- bargaining, a central bank mandate focusing primarily on price stability is sufficient.
103
- With an atomistic labor market, the central bank must also consider output as
104
- a policy objective.
105
- - source_sentence: 'The politics of the European Central Bank: principal-agent theory
106
- and the democratic deficit'
107
- sentences:
108
- - ABSTRACTThe stress test of the European Central Bank has become one of the primary
109
- regulatory tools for the European banking system. In order to make such a regulatory
110
- indicator, different national...
111
- - This paper uses the VAR methodology to analyse the effects of European Central
112
- Bank monetary policy shocks and euro area output and inflation shocks on the European
113
- Union member states from Central and Eastern Europe. First, we look at the strength
114
- of effects of identified euro area monetary policy shocks and compare the influence
115
- with the one of the domestic policy shocks. Next, we turn to analysis of output
116
- and inflation responses to euro area output and inflation shocks relative to aggregate
117
- euro area reaction. We provide implications for each country monetary policy decision
118
- process and draw conclusions concerning readiness for euro adoption, both from
119
- monetary and real economy point of view.
120
- - The "conservative central banker" has come under attack recently. On the basis
121
- of models in which there is explicit interaction between trade union behavior
122
- and monetary policy, it has been argued that if 'trade unions' are averse to inflation,
123
- welfare will be lower with a conservative than with a liberal central bank. We
124
- reframe this discussion in a standard trade union model. We show that the case
125
- against the conservative central banker rests exclusively on the assumption of
126
- a strictly nominal outside option (for instance, unemployment benefits) for the
127
- union. There is no welfare gain associated with making the central bank less conservative
128
- than society, however, if the outside option is in real terms. As the nominal
129
- components of the trade union's outside option are mainly public transfers, we
130
- also show that the conservative central banker is always optimal if the government
131
- can choose the level of nominal unemployment benefits as well as the degree of
132
- central bank conservatism.
133
- - source_sentence: Global Liquidity and Monetary Policy Autonomy
134
- sentences:
135
- - In the 1970s monetarism was accepted as a principle of policy of central banks.
136
- However monetary growth targets currently play no official role in the monetary
137
- policy of industrial countries. The majority of monetary authorities excluded
138
- money from their analysis. Inflation, GDP and unemployment are not determined
139
- by money supply in their models. This article provides a historical perspective
140
- on the development and apparent failure of monetarism as a policy guide.
141
- - Deep financial crisis which started in 2007 proved to be extremely contagious
142
- affecting the financial and banking EU system. Achieving an integrated banking
143
- market is the main component of the European policy in the financial-banking services
144
- area. The latest developments underlined that the difficulties faced by the banks
145
- can negatively impact on the entire financial stability of the member states.
146
- That's why, the European Central Bank will be entitled to supervise any bank of
147
- the euro area, especially the ones that benefit of public support. Reforming the
148
- financial and banking system must be shaped in the frame of insuring some durable
149
- national finances, of an urgent recapitalizing of the banks that need that and
150
- of elaborating some common fiscal and financial and banking regulations effective
151
- in the eurozone
152
- - We have confirmed that the cause of Japan’s recession has been the sharp reduction
153
- in credit creation that began in 1992 and was triggered by the bad debts in the
154
- banking system. We have also found that this was due to excessive loan growth
155
- quotas imposed on the banks by the Bank of Japan during the 1980s. Finally, we
156
- found that the problem of lack of credit during the 1990s could easily have been
157
- solved through monetary policy. Bad debts could have been taken off the banks’
158
- balance sheets without costs by the central bank. Even without bank lending, the
159
- central bank could have created a recovery a decade ago, by significantly increasing
160
- its own credit creation. In other words, Japan’s recession of the 1990s has been
161
- the result of the Bank of Japan’s policies.2
162
- pipeline_tag: sentence-similarity
163
  library_name: sentence-transformers
 
 
 
 
 
 
164
  ---
165
 
166
- # SentenceTransformer
167
-
168
- This is a [sentence-transformers](https://www.SBERT.net) model trained. It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
169
-
170
- ## Model Details
171
-
172
- ### Model Description
173
- - **Model Type:** Sentence Transformer
174
- <!-- - **Base model:** [Unknown](https://huggingface.co/unknown) -->
175
- - **Maximum Sequence Length:** 300 tokens
176
- - **Output Dimensionality:** 1024 dimensions
177
- - **Similarity Function:** Cosine Similarity
178
- <!-- - **Training Dataset:** Unknown -->
179
- <!-- - **Language:** Unknown -->
180
- <!-- - **License:** Unknown -->
181
-
182
- ### Model Sources
183
-
184
- - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
185
- - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
186
- - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
187
-
188
- ### Full Model Architecture
189
-
190
- ```
191
- SentenceTransformer(
192
- (0): Transformer({'max_seq_length': 300, 'do_lower_case': True}) with Transformer model: BertModel
193
- (1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
194
- (2): Normalize()
195
- )
196
- ```
197
-
198
- ## Usage
199
-
200
- ### Direct Usage (Sentence Transformers)
201
-
202
- First install the Sentence Transformers library:
203
-
204
- ```bash
205
- pip install -U sentence-transformers
206
- ```
207
-
208
- Then you can load this model and run inference.
209
- ```python
210
- from sentence_transformers import SentenceTransformer
211
-
212
- # Download from the 🤗 Hub
213
- model = SentenceTransformer("sentence_transformers_model_id")
214
- # Run inference
215
- sentences = [
216
- 'Global Liquidity and Monetary Policy Autonomy',
217
- 'In the 1970s monetarism was accepted as a principle of policy of central banks. However monetary growth targets currently play no official role in the monetary policy of industrial countries. The majority of monetary authorities excluded money from their analysis. Inflation, GDP and unemployment are not determined by money supply in their models. This article provides a historical perspective on the development and apparent failure of monetarism as a policy guide.',
218
- "Deep financial crisis which started in 2007 proved to be extremely contagious affecting the financial and banking EU system. Achieving an integrated banking market is the main component of the European policy in the financial-banking services area. The latest developments underlined that the difficulties faced by the banks can negatively impact on the entire financial stability of the member states. That's why, the European Central Bank will be entitled to supervise any bank of the euro area, especially the ones that benefit of public support. Reforming the financial and banking system must be shaped in the frame of insuring some durable national finances, of an urgent recapitalizing of the banks that need that and of elaborating some common fiscal and financial and banking regulations effective in the eurozone",
219
- ]
220
- embeddings = model.encode(sentences)
221
- print(embeddings.shape)
222
- # [3, 1024]
223
-
224
- # Get the similarity scores for the embeddings
225
- similarities = model.similarity(embeddings, embeddings)
226
- print(similarities.shape)
227
- # [3, 3]
228
- ```
229
-
230
- <!--
231
- ### Direct Usage (Transformers)
232
-
233
- <details><summary>Click to see the direct usage in Transformers</summary>
234
-
235
- </details>
236
- -->
237
-
238
- <!--
239
- ### Downstream Usage (Sentence Transformers)
240
-
241
- You can finetune this model on your own dataset.
242
-
243
- <details><summary>Click to expand</summary>
244
-
245
- </details>
246
- -->
247
-
248
- <!--
249
- ### Out-of-Scope Use
250
-
251
- *List how the model may foreseeably be misused and address what users ought not to do with the model.*
252
- -->
253
-
254
- <!--
255
- ## Bias, Risks and Limitations
256
-
257
- *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
258
- -->
259
-
260
- <!--
261
- ### Recommendations
262
-
263
- *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
264
- -->
265
-
266
- ## Training Details
267
-
268
- ### Training Dataset
269
-
270
- #### Unnamed Dataset
271
-
272
- * Size: 29,026 training samples
273
- * Columns: <code>sentence_0</code>, <code>sentence_1</code>, and <code>label</code>
274
- * Approximate statistics based on the first 1000 samples:
275
- | | sentence_0 | sentence_1 | label |
276
- |:--------|:----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:---------------------------------------------------------------|
277
- | type | string | string | float |
278
- | details | <ul><li>min: 6 tokens</li><li>mean: 15.09 tokens</li><li>max: 48 tokens</li></ul> | <ul><li>min: 18 tokens</li><li>mean: 134.34 tokens</li><li>max: 288 tokens</li></ul> | <ul><li>min: 0.0</li><li>mean: 0.48</li><li>max: 1.0</li></ul> |
279
- * Samples:
280
- | sentence_0 | sentence_1 | label |
281
- |:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------|
282
- | <code>Half-Life Estimation under the Taylor Rule ∗</code> | <code>Abstract We aim to investigate the simultaneous and interacted effects of central bank qualitative and quantitative communication on private inflation expectations, measured with survey and market-based measures. The effects of ECB inflation projections and Governing Council members’ speeches are identified through an instrumental-variables estimation using a principal component analysis to generate relevant instruments. We find that ECB projections have a positive effect on current-year forecasts, and that ECB projections and speeches are substitutes at longer horizons. Moreover, ECB speeches and the ECB rate reinforce the effect of ECB projections when they are consistent, and convey the same signal about inflationary pressures.</code> | <code>0.0</code> |
283
- | <code>Lasting Legacy on Financial Regulatory reform</code> | <code>At their meeting on October 23, the G20 finance ministers and central bank governors prioritized a number of regulatory issues for discussion at the leader’s summit. The familiar items included a ritualistic commitment to implement all reforms endorsed already by the G20 in an internationally consistent and non-discriminatory manner, such as those relating to over-the-counter derivatives, compensation practices, accounting standards and credit rating agencies.</code> | <code>1.0</code> |
284
- | <code>Tři roky po měnové krizi: rekapitulace událostí a jejich souvislostí [Three years after the exchange rate crisis: recapitulation of events and their connections]</code> | <code>We examine the European Central Bank's ad-hoc communication and explore how it informs future monetary policy decisions. Using the rich dataset of the inter-meeting verbal communication among the members of the European Central Bank's Governing Council between 2008 and 2014, we construct a measure of communication assessing its inclination towards easing, tightening or maintaining the monetary policy stance. We find that this measure provides useful additional information about future monetary policy decisions, even when we control for market-based interest rate expectations and lagged decisions. Our results also suggest that, in particular, communication shortly before monetary policy meetings, related to unconventional measures and/or by the ECB President explain the future ECB rate changes well. Overall, these results point to the importance of transparency in understanding the future course of monetary policy.</code> | <code>0.0</code> |
285
- * Loss: [<code>CosineSimilarityLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosinesimilarityloss) with these parameters:
286
- ```json
287
- {
288
- "loss_fct": "torch.nn.modules.loss.MSELoss"
289
- }
290
- ```
291
-
292
- ### Training Hyperparameters
293
- #### Non-Default Hyperparameters
294
-
295
- - `per_device_train_batch_size`: 16
296
- - `per_device_eval_batch_size`: 16
297
- - `num_train_epochs`: 1
298
- - `multi_dataset_batch_sampler`: round_robin
299
-
300
- #### All Hyperparameters
301
- <details><summary>Click to expand</summary>
302
-
303
- - `overwrite_output_dir`: False
304
- - `do_predict`: False
305
- - `eval_strategy`: no
306
- - `prediction_loss_only`: True
307
- - `per_device_train_batch_size`: 16
308
- - `per_device_eval_batch_size`: 16
309
- - `per_gpu_train_batch_size`: None
310
- - `per_gpu_eval_batch_size`: None
311
- - `gradient_accumulation_steps`: 1
312
- - `eval_accumulation_steps`: None
313
- - `torch_empty_cache_steps`: None
314
- - `learning_rate`: 5e-05
315
- - `weight_decay`: 0.0
316
- - `adam_beta1`: 0.9
317
- - `adam_beta2`: 0.999
318
- - `adam_epsilon`: 1e-08
319
- - `max_grad_norm`: 1
320
- - `num_train_epochs`: 1
321
- - `max_steps`: -1
322
- - `lr_scheduler_type`: linear
323
- - `lr_scheduler_kwargs`: {}
324
- - `warmup_ratio`: 0.0
325
- - `warmup_steps`: 0
326
- - `log_level`: passive
327
- - `log_level_replica`: warning
328
- - `log_on_each_node`: True
329
- - `logging_nan_inf_filter`: True
330
- - `save_safetensors`: True
331
- - `save_on_each_node`: False
332
- - `save_only_model`: False
333
- - `restore_callback_states_from_checkpoint`: False
334
- - `no_cuda`: False
335
- - `use_cpu`: False
336
- - `use_mps_device`: False
337
- - `seed`: 42
338
- - `data_seed`: None
339
- - `jit_mode_eval`: False
340
- - `use_ipex`: False
341
- - `bf16`: False
342
- - `fp16`: False
343
- - `fp16_opt_level`: O1
344
- - `half_precision_backend`: auto
345
- - `bf16_full_eval`: False
346
- - `fp16_full_eval`: False
347
- - `tf32`: None
348
- - `local_rank`: 0
349
- - `ddp_backend`: None
350
- - `tpu_num_cores`: None
351
- - `tpu_metrics_debug`: False
352
- - `debug`: []
353
- - `dataloader_drop_last`: False
354
- - `dataloader_num_workers`: 0
355
- - `dataloader_prefetch_factor`: None
356
- - `past_index`: -1
357
- - `disable_tqdm`: False
358
- - `remove_unused_columns`: True
359
- - `label_names`: None
360
- - `load_best_model_at_end`: False
361
- - `ignore_data_skip`: False
362
- - `fsdp`: []
363
- - `fsdp_min_num_params`: 0
364
- - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
365
- - `fsdp_transformer_layer_cls_to_wrap`: None
366
- - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
367
- - `deepspeed`: None
368
- - `label_smoothing_factor`: 0.0
369
- - `optim`: adamw_torch
370
- - `optim_args`: None
371
- - `adafactor`: False
372
- - `group_by_length`: False
373
- - `length_column_name`: length
374
- - `ddp_find_unused_parameters`: None
375
- - `ddp_bucket_cap_mb`: None
376
- - `ddp_broadcast_buffers`: False
377
- - `dataloader_pin_memory`: True
378
- - `dataloader_persistent_workers`: False
379
- - `skip_memory_metrics`: True
380
- - `use_legacy_prediction_loop`: False
381
- - `push_to_hub`: False
382
- - `resume_from_checkpoint`: None
383
- - `hub_model_id`: None
384
- - `hub_strategy`: every_save
385
- - `hub_private_repo`: None
386
- - `hub_always_push`: False
387
- - `gradient_checkpointing`: False
388
- - `gradient_checkpointing_kwargs`: None
389
- - `include_inputs_for_metrics`: False
390
- - `include_for_metrics`: []
391
- - `eval_do_concat_batches`: True
392
- - `fp16_backend`: auto
393
- - `push_to_hub_model_id`: None
394
- - `push_to_hub_organization`: None
395
- - `mp_parameters`:
396
- - `auto_find_batch_size`: False
397
- - `full_determinism`: False
398
- - `torchdynamo`: None
399
- - `ray_scope`: last
400
- - `ddp_timeout`: 1800
401
- - `torch_compile`: False
402
- - `torch_compile_backend`: None
403
- - `torch_compile_mode`: None
404
- - `include_tokens_per_second`: False
405
- - `include_num_input_tokens_seen`: False
406
- - `neftune_noise_alpha`: None
407
- - `optim_target_modules`: None
408
- - `batch_eval_metrics`: False
409
- - `eval_on_start`: False
410
- - `use_liger_kernel`: False
411
- - `eval_use_gather_object`: False
412
- - `average_tokens_across_devices`: False
413
- - `prompts`: None
414
- - `batch_sampler`: batch_sampler
415
- - `multi_dataset_batch_sampler`: round_robin
416
-
417
- </details>
418
-
419
- ### Training Logs
420
- | Epoch | Step | Training Loss |
421
- |:------:|:----:|:-------------:|
422
- | 0.2755 | 500 | 0.0844 |
423
- | 0.5510 | 1000 | 0.0667 |
424
- | 0.8264 | 1500 | 0.0607 |
425
 
 
426
 
427
- ### Framework Versions
428
- - Python: 3.11.13
429
- - Sentence Transformers: 4.1.0
430
- - Transformers: 4.52.4
431
- - PyTorch: 2.7.1+cu126
432
- - Accelerate: 1.7.0
433
- - Datasets: 2.14.4
434
- - Tokenizers: 0.21.1
435
 
436
- ## Citation
437
 
438
- ### BibTeX
439
 
440
- #### Sentence Transformers
441
- ```bibtex
442
- @inproceedings{reimers-2019-sentence-bert,
443
- title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
444
- author = "Reimers, Nils and Gurevych, Iryna",
445
- booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
446
- month = "11",
447
- year = "2019",
448
- publisher = "Association for Computational Linguistics",
449
- url = "https://arxiv.org/abs/1908.10084",
450
- }
451
- ```
452
 
453
- <!--
454
- ## Glossary
455
 
456
- *Clearly define terms in order to be accessible across audiences.*
457
- -->
458
 
459
- <!--
460
- ## Model Card Authors
461
 
462
- *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
463
- -->
464
 
465
- <!--
466
- ## Model Card Contact
467
 
468
- *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
469
- -->
 
1
  ---
2
+ license: mit
3
  tags:
4
  - sentence-transformers
5
  - sentence-similarity
6
+ - central-bank
7
+ - semantic-textual-similarity
8
  - feature-extraction
9
+ - dataset:sentence-transformers/s2orc
10
+ - dataset:allenai/peS2o
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  library_name: sentence-transformers
12
+ pipeline_tag: sentence-similarity
13
+ base_model: BAAI/bge-large-en-v1.5
14
+ language: en
15
+ datasets:
16
+ - allenai/peS2o
17
+ - sentence-transformers/s2orc
18
  ---
19
 
20
+ # bge-centralbank
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
21
 
22
+ **bge-centralbank** is a domain-adapted Sentence Transformer developed to assess semantic similarity in central bank-related texts. It is based on [`BAAI/bge-large-en-v1.5`](https://huggingface.co/BAAI/bge-large-en-v1.5), and adapted through both unsupervised and supervised training.
23
 
24
+ ## Training Setup
 
 
 
 
 
 
 
25
 
26
+ ### 1. Pretraining
27
 
28
+ The model was first pretrained using TSDAE on 177,842 title–abstract pairs drawn from the `peS2o` dataset — a lightweight and pre-cleaned subset of the full S2ORC corpus. From this dataset, a domain-specific corpus was constructed by filtering on keywords relevant to macroeconomics, monetary policy, and financial markets. This step enables the model to better capture the language structures and terminology common in central bank literature.
29
 
30
+ ### 2. Supervised Fine-tuning
 
 
 
 
 
 
 
 
 
 
 
31
 
32
+ Fine-tuning was conducted using a subset of the `sentence-transformers/s2orc` dataset. All **title–abstract pairs** were selected where the abstract contained the term *central bank*, resulting in 15,513 positive examples (label = 1). Each pair represents a real paper, with a matching title and abstract.
 
33
 
34
+ An equal number of negative examples (label = 0) was generated by randomly mismatching titles and abstracts from unrelated papers.
 
35
 
36
+ A separate validation set of 2,000 title–abstract pairs (1,000 positive, 1,000 negative) was held out during training. The remaining 29,026 examples were used for supervised training with `CosineSimilarityLoss` using the `sentence-transformers` framework.
 
37
 
38
+ ## Evaluation
 
39
 
40
+ The model was evaluated on the held-out validation set. Compared to the base model, `bge-centralbank` achieved a substantially lower average similarity score on negative pairs (from **0.6230** to **0.1177**), showing improved ability to distinguish semantically unrelated text. It also achieved a higher **point-biserial correlation** (from **0.7933** to **0.9025**), indicating better alignment with binary STS labels.
 
41
 
42
+ ---